HI
I have more than 10,000 protein IDS, I'm interested in extracting all the fasta sequences of these proteins ids from uniprot.
What I did so, far-- Already I downloaded all the fasta sequences of the organism I'm interested in.
How can I do, need suggestions.
Use
blast+
preformattednr
database along withblastdbcmd
utility. Use-entry_batch
option to do a large number of accessions.An example for a single accession below.
Moving this to a comment since
nr
may not contain all UniProt ID's and if that is all you have then this would not be sufficient.