I have a list of gi corresponding to the nucleotide sequence in NCBI. I need to compile a file of cds sequences of those gi. I attempted to use batch entrez but it yielded whole genome sequences.
Can anyone suggest a method to retrieve cds sequences from NCBI given I have a list of corresponding gi
You can try the e-utilities, for example e-fetch.
A greedy solution could be to get the genbank file corresponding to your gi with efetch with get, something like that in Perl:
Then parse the genbank with BioPerl:
If you only need the CDS it may be a little bit to much. But if you are going to need other information from the genbank file it could be useful.
There is probably a simpler and more elegant solution.