Not sure if this is the right forum for this.
Basically, I'm having some issues trying to find formatting guidelines for genbank. I'm trying to build a parser for genbank files, and looking at the lambda virus genome here there is a loc string complement(<23231)
.
What exactly does this mean? Does it refer to the complement between coordinate 1 and 23231? Also, is the reverse possible? i.e. complement(>23231)
.
NCBI Genbak uses flat file format. Details can be found in the following link. Also, check the biopython GenBank module.
https://www.ncbi.nlm.nih.gov/Sitemap/samplerecord.html