Entering edit mode
4.5 years ago
i.sudbery
20k
Do protein coding genes in Ensembl/Gencode GTF files always only have one CDS? For example if a protein has a uORF that has been demonstrated to be translated, will it be annotated as a second CDS in the transcript, or can I always assume that the 5' most start codon in a GTF record is the start of the main CDS?