maize ESTs NCBI
1
0
Entering edit mode
10.0 years ago
dinesh ▴ 50

My question looks simple but I'm new to bioinformatics. I have a task of downloading maize ests which are around 20 lakhs. So, how can I download this ESTs in batches?

For example: 50000 ESTs in one file and next 40000 ESTs in one file like wise.

Thanks in advance

NCBI • 2.1k views
ADD COMMENT
0
Entering edit mode
10.0 years ago

Go to http://www.ncbi.nlm.nih.gov/nucest?term=%22zea%20mays%22[Organism]

Select 'send to -> file -> fasta ' & download

Use awk to produce sequences with two lines (name+header) and split

awk '/^>/ {if(NR>1) printf("\n"); print;next;} {printf("%s",$0);}' your.fasta  |  split -l 100000 - prefix
ADD COMMENT

Login before adding your answer.

Traffic: 2634 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6