How to get list of GenBank accession Ids by a Taxonomy Id?
1
0
Entering edit mode
7.7 years ago
Charles Yin ▴ 180

In NCBI genbank, one Taxonomy Id corresponds a list of GenBank accession Ids, how do I retrieve these geneBank Ids by Python per a given Taxonomy Id? Thank you in advance.

sequence Genbank • 2.5k views
ADD COMMENT
0
Entering edit mode

Thanks! But what is the file? I tried to download that file, it is about 1GB. I wish to use Biopython to retrieve GeneBank Ids from a given Taxonomy Id automatically. Is that possible?

ADD REPLY
0
Entering edit mode

If you must use biopython then see this: Retrieve Genbanks From Taxid

BTW: Use ADD REPLY/ADD COMMENT when responding to existing posts to keep threads logically organized.

ADD REPLY
0
Entering edit mode
7.7 years ago
GenoMax 147k

Get this file (it is large), uncompress and parse out the ID's you need.

For example for taxID 964: awk '{FS="\t";OFS="\t"}{if ($1 == 964) print $1,$6,$8;}' gene2accession

ADD COMMENT

Login before adding your answer.

Traffic: 1886 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6