|
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
|
| build | chrom | start | end | gene_id | gene_name | gene_biotype | go_id | go_label | |
|
================================================================================================================================================================================================================================================ |
|
| "GRCh38" | "1" | "20740266"^^<http://www.w3.org/2001/XMLSchema#int> | "20787323"^^<http://www.w3.org/2001/XMLSchema#int> | "ENSG00000127483" | "HP1BP3" | "protein_coding" | "GO:0006334" | "nucleosome assembly" | |
|
| "GRCh38" | "1" | "20740266"^^<http://www.w3.org/2001/XMLSchema#int> | "20787323"^^<http://www.w3.org/2001/XMLSchema#int> | "ENSG00000127483" | "HP1BP3" | "protein_coding" | "GO:0000786" | "nucleosome" | |
|
| "GRCh38" | "1" | "20740266"^^<http://www.w3.org/2001/XMLSchema#int> | "20787323"^^<http://www.w3.org/2001/XMLSchema#int> | "ENSG00000127483" | "HP1BP3" | "protein_coding" | "GO:0005634" | "nucleus" | |
|
| "GRCh38" | "1" | "20740266"^^<http://www.w3.org/2001/XMLSchema#int> | "20787323"^^<http://www.w3.org/2001/XMLSchema#int> | "ENSG00000127483" | "HP1BP3" | "protein_coding" | "GO:0070828" | "heterochromatin organization" | |
|
| "GRCh38" | "1" | "20740266"^^<http://www.w3.org/2001/XMLSchema#int> | "20787323"^^<http://www.w3.org/2001/XMLSchema#int> | "ENSG00000127483" | "HP1BP3" | "protein_coding" | "GO:0031491" | "nucleosome binding" | |
|
| "GRCh38" | "1" | "20740266"^^<http://www.w3.org/2001/XMLSchema#int> | "20787323"^^<http://www.w3.org/2001/XMLSchema#int> | "ENSG00000127483" | "HP1BP3" | "protein_coding" | "GO:0006355" | "regulation of DNA-templated transcription" | |
|
| "GRCh38" | "1" | "20740266"^^<http://www.w3.org/2001/XMLSchema#int> | "20787323"^^<http://www.w3.org/2001/XMLSchema#int> | "ENSG00000127483" | "HP1BP3" | "protein_coding" | "GO:0003677" | "DNA binding" | |
|
| "GRCh38" | "1" | "20740266"^^<http://www.w3.org/2001/XMLSchema#int> | "20787323"^^<http://www.w3.org/2001/XMLSchema#int> | "ENSG00000127483" | "HP1BP3" | "protein_coding" | "GO:0005694" | "chromosome" | |
|
| "GRCh38" | "1" | "20740266"^^<http://www.w3.org/2001/XMLSchema#int> | "20787323"^^<http://www.w3.org/2001/XMLSchema#int> | "ENSG00000127483" | "HP1BP3" | "protein_coding" | "GO:0016607" | "nuclear speck" | |
|
| "GRCh38" | "1" | "20740266"^^<http://www.w3.org/2001/XMLSchema#int> | "20787323"^^<http://www.w3.org/2001/XMLSchema#int> | "ENSG00000127483" | "HP1BP3" | "protein_coding" | "GO:0042127" | "regulation of cell population proliferation" | |
|
| "GRCh38" | "1" | "20740266"^^<http://www.w3.org/2001/XMLSchema#int> | "20787323"^^<http://www.w3.org/2001/XMLSchema#int> | "ENSG00000127483" | "HP1BP3" | "protein_coding" | "GO:0005515" | "protein binding" | |
|
| "GRCh38" | "1" | "20740266"^^<http://www.w3.org/2001/XMLSchema#int> | "20787323"^^<http://www.w3.org/2001/XMLSchema#int> | "ENSG00000127483" | "HP1BP3" | "protein_coding" | "GO:0097298" | "regulation of nucleus size" | |
|
| "GRCh38" | "1" | "20740266"^^<http://www.w3.org/2001/XMLSchema#int> | "20787323"^^<http://www.w3.org/2001/XMLSchema#int> | "ENSG00000127483" | "HP1BP3" | "protein_coding" | "GO:0071456" | "cellular response to hypoxia" | |
|
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
Thank you so much for the detailed guidance! Just a quick query - would I be running these scripts in a bash environment to replicate the results?
Also, regarding the integration site query, will this process be able to identify the regulatory role of the gene, such as whether the site falls within a promoter or enhancer region?
yeah, I used sparql for fun but i you don't know them, you should use tools like
bedtools intersect
andjoin
....I have been using Linux environment but fairly new, still I'm eager to give them a try. Just to confirm, should I run the
awk
command you provided first like this:Followed by executing the Makefile with:
I'm not quite sure how to proceed with executing the
query.01.sparql
afterward. Could you please provide guidance on this? Please correct me if I'm wrong. Appreciate your help.just