Downloading unique data from refseq
0
0
Entering edit mode
8.7 years ago

Hi

I would like to download all the bacterial files from the genome database bearing .fna.gz extension in one go. The problem is that there are around 10000 bacterial entries in the database and each filename is unique. Any suggestions.

fasta bacteria refseq • 1.9k views
ADD COMMENT
0
Entering edit mode

I would suggest that it would be a good idea to use the search.

ADD REPLY
0
Entering edit mode

@avinashdhar123: When you use this method to download the data make a note that NCBI has moved the bacterial genomes to this directory: ftp://ftp.ncbi.nih.gov/genomes/refseq/bacteria/

ADD REPLY
0
Entering edit mode

Thank you for the reply. Command I used is:

wget -cNrv -t 45 -A ".fna" "ftp://ftp.ncbi.nih.gov/genomes/refseq/bacteria/"

After running this command only folders are downloading not the files inside it.

Any suggestions.

ADD REPLY
0
Entering edit mode

NCBI is not very easily organized now.

See the post C: where can I get environmental bacteria genome in fasta format (as many as possib,

and my answer inside it, it may help you.

ftp://ftp.ncbi.nlm.nih.gov/genomes/archive/old_refseq/Bacteria/

you will find the file you need in the list above. It is not updated,

recently sequenced bacteria are absent there.

ADD REPLY

Login before adding your answer.

Traffic: 2445 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6