Entering edit mode
4.7 years ago
dimitrischat
▴
210
Hello,
I did a blastp search on longest ORFS using a uniprot database. Now i got an excel file with Q9LIK7 , Q9FQ03 etc protein ids. But i would like to get on all of them the amino acid sequence in fasta format. I can get for one of them each time, but doing so, one at a time is too much time. Is there any easier way?
You can use UniProt API to download the sequences. Examples are shown here.
You could also download the fasta formatted UniProt data from their ftp site. Then use a program like
faSomeRecords
from Jim Kent's UCSC utils to extract sequences you need.