Hello,
I would like to download all the reference sequences, full length for a given organism. I am using esearch as reported on the NCBIwebsitee with the following command:
esearch -db "nucleotide" -query "txidX[Organism] AND refseq[filter]"|efetch -format fasta > genome.fa
where X is the code for a given taxon. This works but I get both 'complete genome' and 'complete sequence' entries.
Is it possible to get only the 'complete genome' entries? Thank you
thank you, but the taxon I am looking for contains both complete genomes and sequences; still is there a way to separate them, either directly with an option of esearch or afterward with the manipulation of the resulting fasta file?