Hi every body!
I've encountered a problem with CDS of adenovirus's genes. I want to analyse the G/C ratio and codon usage of these genes. But there are "less-than-sign" in the CDS of some of these genes (e.g. CDS <11565..12297 http://www.ebi.ac.uk/ena/data/view/M73260&display=text ). One of the strange things is that the CDS length (12297-11565+1) isn't dividable by three. What is the meaning of "less-than-sign" in CDS?
Take a look at the example GenBank record page here and the section on features. Here is the relevant extract
If the "<" symbol precedes a base span, the sequence is partial on the
5' end (e.g., CDS <1..206). If the ">" symbol follows a base span,
the sequence is partial on the 3' end (e.g., CDS 435..915>).
Thanks. Your reply is really helpful. What's your suggestion for computing G/C content and codon usage of such genes with partial CDS?