Entering edit mode
10.7 years ago
4evo.11.ove4
•
0
I know refseq may duplicate gene IDs if they map to multiple regions on the genome.
This is different - I'm finding duplicates even for genes which map to _identical_ regions on the genome. For example, this query for chr1:77548961-77812107 on the most recent rnor5 genome returns two results:
The two results are exactly identical. Why is this?
>rn5_refGene_NM_001025131 range=chr1:77548961-77812107 5'pad=0 3'pad=0 strand=- repeatMasking=none
tcatgaacttgggcgcacaaatgcagcagaatgttctttgtatgagttca
aattgctcaataaaataatccttgtctctgtgttacaatatttatttatt
cctatcagtagctagtttcacaagagactagagaatgttaatgatttttt
aattgcactataaatcttacctctcagcatttgctataaacagaacaggt
ATGATGTCAGACTATACTTGGTTTGAAGGAATACCTTTTCCTGCCTTTTG
GTTTTCCAAAGAAATTCTGGAAAATAGTTGTAAGAAGTTTGTGGTAAAAG
AAGACGACTTGATCATATTGACTTACCCCAAGTCAGGAACGAACTGGCTG
ATCGAGATTGTCTGCTTGATTCAGACCAAGGGAGATCCCAAGTGGATCCA
ATCTATGCCCATCTGGGATCGCTCACCCTGGATAGAGACTGGTTCAGGAT
ATGATAAATTAACCAAAATGGAAGGACCACGACTCATGACCTCCCATCTT
CCCATGCATCTTTTCTCCAAGTCTCTCTTCAGTTCCAAGGCCAAGGTGAT
ATATCTCATCAGAAATCCCAGAGATGTTCTTGTTTCTGCTTATTTTTTCT
GGAGTAAGATCGCCCTGGAGAAGAAACCAGACTCGCTGGGAACTTACGTT
GAATGGTTCCTCAAAGGAAATGTTGCATATGGATCATGGTTTGAGCACAT
CCGTGGCTGGCTGTCTATGAGAGAATGGGACAACTTCTTGGTACTGTACT
ATGAAGACATGAAAAAGGATACAATGGGATCCATAAAGAAGATATGTGAC
TTCCTGGGGAAAAAATTAGAGCCAGATGAGCTGAATTTGGTCCTCAAGTA
TAGTTCCTTCCAAGTCGTGAAAGAAAACAACATGTCCAATTATAGCCTCA
TGGAGAAGGAACTGATTCTTACTGGTTTTACTTTCATGAGAAAAGGCACA
ACTAATGACTGGAAGAATCACTTCACAGTAGCCCAAGCTGAAGCCTTTGA
TAAAGTGTTCCAGGAGAAAATGGCCGGTTTCCCTCCAGGGATGTTCCCAT
GGGAATAAattgtgaaagtaattttttaaaagatagtattattaatacta
gtagtatcagcggtggtggtggtggtggtggtggtggtggtggtggtggt
ggtggtgggtgtacctttcagtttgtagtgcctttagatgccagaacact
gcactggtactagggacctgcagtttcattaagctgtaaacttcttgctt
tgggtgct
>rn5_refGene_NM_001025131 range=chr1:77548961-77812107 5'pad=0 3'pad=0 strand=- repeatMasking=none
tcatgaacttgggcgcacaaatgcagcagaatgttctttgtatgagttca
aattgctcaataaaataatccttgtctctgtgttacaatatttatttatt
cctatcagtagctagtttcacaagagactagagaatgttaatgatttttt
aattgcactataaatcttacctctcagcatttgctataaacagaacaggt
ATGATGTCAGACTATACTTGGTTTGAAGGAATACCTTTTCCTGCCTTTTG
GTTTTCCAAAGAAATTCTGGAAAATAGTTGTAAGAAGTTTGTGGTAAAAG
AAGACGACTTGATCATATTGACTTACCCCAAGTCAGGAACGAACTGGCTG
ATCGAGATTGTCTGCTTGATTCAGACCAAGGGAGATCCCAAGTGGATCCA
ATCTATGCCCATCTGGGATCGCTCACCCTGGATAGAGACTGGTTCAGGAT
ATGATAAATTAACCAAAATGGAAGGACCACGACTCATGACCTCCCATCTT
CCCATGCATCTTTTCTCCAAGTCTCTCTTCAGTTCCAAGGCCAAGGTGAT
ATATCTCATCAGAAATCCCAGAGATGTTCTTGTTTCTGCTTATTTTTTCT
GGAGTAAGATCGCCCTGGAGAAGAAACCAGACTCGCTGGGAACTTACGTT
GAATGGTTCCTCAAAGGAAATGTTGCATATGGATCATGGTTTGAGCACAT
CCGTGGCTGGCTGTCTATGAGAGAATGGGACAACTTCTTGGTACTGTACT
ATGAAGACATGAAAAAGGATACAATGGGATCCATAAAGAAGATATGTGAC
TTCCTGGGGAAAAAATTAGAGCCAGATGAGCTGAATTTGGTCCTCAAGTA
TAGTTCCTTCCAAGTCGTGAAAGAAAACAACATGTCCAATTATAGCCTCA
TGGAGAAGGAACTGATTCTTACTGGTTTTACTTTCATGAGAAAAGGCACA
ACTAATGACTGGAAGAATCACTTCACAGTAGCCCAAGCTGAAGCCTTTGA
TAAAGTGTTCCAGGAGAAAATGGCCGGTTTCCCTCCAGGGATGTTCCCAT
GGGAATAAattgtgaaagtaattttttaaaagatagtattattaatacta
gtagtatcagcggtggtggtggtggtggtggtggtggtggtggtggtggt
ggtggtgggtgtacctttcagtttgtagtgcctttagatgccagaacact
gcactggtactagggacctgcagtttcattaagctgtaaacttcttgctt
tgggtgct