Entering edit mode
6.6 years ago
ThePresident
▴
180
I have a list of protein accession identifiers such as "CBE06962.1". I would like to automatically extract several features such as locus_tag, start and stop positions of the corresponding genes, UniProt tags etc. Is it possible to do it with by combining esearch
and efetch
from e-utilities
, something like:
esearch -db protein -query "CBE06962.1" | efetch ???
If not, I am thinking of downloading all gbanks files and then parsing it for the info I want.
Any suggestions? Thanks in advance,
TP
It's a good start, I can at least have the start/stop positions, the strand and nucleotide accession. One thing that would be really useful is a locus_tag.
Thanks, this is still pretty good though.
TP
A single clean solution may be possible but at least this will get you started
Since this brings back a GenPept format record you can grep for several other pieces of information.
Many thanks, this looks pretty good.