I have been trying to retrieve cds start and end position and gene length for about 1000 protein accessions (eg, ALD89117.1, ALD89128.1, ALD89126.1, ANR02692.1,AVA17449.1) I have as input. Would someone be kind enough to share code or expertise telling how to get this done? Thanks
@Sej provided an answer this morning: C: download genbank sequences with exon sequences highlighted
Modify as necessay. She is referring to NCBI unix utils.