Dear all,
I've recently resurrected a bioinformatics pipeline I put together a few years ago to load bacterial genome GenBank files into a MySQL database.
Having downloaded *.gbk from the genomes/Bacteria directory on ftp.ncbi.nih.gov, however, I get an error with many of the files:
ValueError: Expected CONTIG continuation line, got:
ORIGIN
It seems that there's an additional CONTIG field immediately before the nucleotide sequence (for example, this occurs with NC_019435.gbk).
Can anyone shed any light on the reason for this error now arising? I don't have any 'old versions' of these files around, but assume that the file format has been modified by GenBank?
many thanks,
Morgan
Standard Genbank file is an oxymoron, there might be an standard but people do what they please with it, it part of its definition.
Yeah, but it is an NCBI format so what they do is usually the standard ;)