Hi:
I have been analyzing RNA-seq data, in the output file the gene coordinates is listed as format like "CHR_MG117_PATCH:108783796-108806337"
What does this mean? How can I get information on this gene?
Thanks,
Hi:
I have been analyzing RNA-seq data, in the output file the gene coordinates is listed as format like "CHR_MG117_PATCH:108783796-108806337"
What does this mean? How can I get information on this gene?
Thanks,
CHR_MG117_PATCH:108783796-108806337
happens to be from mouse genome. Entire patch is available here.
Genome patches are described in this FAQ section on Genome Research Consortium site.
I am assuming you are working on the human genome.
The human reference genome is still a work in progress, so some areas need improvements, or you can have alternative sequences of the same part of the genome.
Instead of waiting for the next release of the human reference, the Genome Reference Consortium chooses to release patches / fixes for these small regions. Hence the "weird" names, for example your sequence correspond to a patch where the primary assembly was found to be incorrect.
If you want more information about this: http://www.ensembl.info/2011/05/20/accessing-non-reference-sequences-in-human/
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Thank you for your reply! I understand now