Hi,
I got partial protein sequences (one domain from bigger proteins) and I want to know the corresponding DNA sequence of that part of the protein. I know the proteins GI number so I can get the DNA origin of the complete protein sequence from NCBI.
So for example the partial sequence is:
GKYVRYTPEQVEALERLYHDCPKPSSIRRQQLIRECPILSNIEPKQIKVWFQNRRCREKQRKEASRLQAVNRKL
TAMNKLLMEENDRLQKQVSQLVH
The whole protein is:
MAMSCKDGKLGCLDNGKYVRYTPEQVEALERLYHDCPKPSSIRRQQLIRECPILSNIEPKQIKVWFQNRRCRE
KQRKEASRLQAVNRKLTAMNKLLMEENDRLQKQVSQLVHENSYFRQHTPNPSLPAKDTSCESVVTSGQHQLA
SQNPQRDASPAGLLSIAEETLAEFLSKATGTAVEWVQMPGMKPGPDSIGIIAISHGCTGVAARACGLVGLEPTR
VAEIVKDRPSWFRECRAVEVMNVLPTANGGTVELLYMQLYA
And the dna sequence on NCBI is (cut both protein and dna off so they aren't exactly the same as on ncbi):
ACATCTCTTCTTCATCCTCTCTTCTACTTTCCTCTTTCCTCTTTCCTTCTTCGAATAAATTTCTAGGGTT
TTTCTTTTCTCTAAAGTTTTCATTTTTATTTCAATGAGAGCTCGAAGAAGGAGAATATGGGTTTGAGAAC
TGATAATATTATGGCTTCGTTTCGAGGTGGAATCGGGGTTTCTAATGGCTGAGTCAACTCGGTGATTCTG
TGTTATAGTCACGAGCAAATATAAAAAAGTTTGTAACTTTCTTGTTTTTTTAGGTGTGTGTGTTCAGAGA
AAAGGTCGAATCTTTTTTCGGTGTTTGTAAAAGGGAAAGTTGTAATCTTAAAGTCTGTTTTTCTTTCTTG
TGTTTTGGTATTTAGCTCATAAAAGCCGAGGAGTAATATAAAGGATAGGTTTTGTCTTTGTGTGCCCTTT
TGAGATTGCATGAAGAAAAAAAGCCTCTAGTGTGTTTTGAAGGAAACAGAATTCGATATTTATGCGGTAA
TGTGATTTGTGAAGCTACTCCAAGTGCTTAGGATTTGAGATGGCTTAGATTTGGTAGTTGTTCAAGCTGT
GGAGTTTGTGGTGGACTAAGAAGCTCTCTGTCTCCTTTGTTTAGTATGTTGTGGTTATCTTCTGTTTAGA
AGGATTTAGTTATTCATCTGGAGGGGGTAGTAGGGTCATTTGTGAGATTCTGTGATTGTGAAATAAGAAG
AGTTTTGCTGAGGAGTAATGGCAATGTCTTGCAAGGATGGTAAGTTGGGATGTTTGGATAATGGGAAGTA
TGTGAGGTATACACCTGAACAAGTTGAAGCACTTGAGAGGCTTTATCATGACTGTCCTAAACCGAGTTCT
ATTCGCCGTCAGCAGTTGATCAGAGAGTGTCCTATTCTCTCTAACATTGAGCCTAAACAGATCAAAGTGT
GGTTTCAGAACCGAAGATGTAGAGAGAAACAAAGGAAAGAGGCTTCACGGCTTCAAGCTGTGAATCGGAA
GTTGACGGCAATGAACAAGCTCTTGATGGAGGAGAATGACAGGTTGCAGAAGCAAGTGTCACAGCTGGTC
CATGAAAACAGCTACTTCCGTCAACATACTCCAAATCCTTCACTCCCAGCTAAAGACACAAGCTGTGAAT
CGGTGGTGACGAGTGGTCAGCACCAATTGGCATCTCAAAATCCTCAGAGAGATGCTAGTCCTGCAGGACT
TTTGTCCATTGCAGAAGAAACTTTAGCAGAGTTTCTTTCAAAGGCAACTGGAACCGCTGTTGAGTGGGTT
CAGATGCCTGGAATGAAGCCTGGTCCGGATTCCATTGGAATCATCGCTATTTC
And I want to know what the DNA sequence is for the partial protein sequence.
I think that know how I could program this but I was wondering if you know an existing script or program that already does that?
Thanks, Niek
Do you know if bl2seq has a different name in blast 2.2.24+, or if they removed it?
I'm yet to upgrade to 2.2.24, so I don't know for sure. As far as I know it is still in the package. If not, the older versions are still available to download.