How to download all the complete genomes of a bacterium (eg E. coli, Mycobacterium tuberculosis etc) in the .gbk format from NCBI?
2
0
Entering edit mode
6.4 years ago
savscosta • 0

I tried downloading the complete genomes of several bacterial from the NCBI site but i didnt succeed

genome • 2.7k views
ADD COMMENT
0
Entering edit mode
ADD REPLY
2
Entering edit mode
6.4 years ago
Joe 21k

Use Kai Blin's ncbi-genome-download tool. You can pass it a list of the Taxa IDs you want.

https://github.com/kblin/ncbi-genome-download

If you don't know which taxids you need, use the script I wrote which is in the same repository as the tool under the /contrib/ folder

ADD COMMENT
1
Entering edit mode
6.4 years ago
natasha.sernova ★ 4.0k

See my answer in this post:

where can I get environmental bacteria genome in fasta format (as many as possible)?

the .gbk format doesn't exist in NCBI now, you need to transform *.gbff-file to gbk or use another database.

In case you don't need some recent genome versions, search bacterial genomes you need in the file inside the post above.

I've repeated the link below. And these genomes are in gbk-format.

ftp://ftp.ncbi.nlm.nih.gov/genomes/genbank/assembly_summary_genbank.txt

Copy the ftp-links you need and download the corresponding genomes, one after another.

how to download all the complete genomes for mycobacteria from NCBI? - this 5heikki's post gives a computational way to do it.

ADD COMMENT

Login before adding your answer.

Traffic: 1526 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6