Before I ask the question to the UCSC mailing list: is it me or something else ?
I've noticed that some records (not all) in hg18/UCSC ensGene.txt coding for a protein have a size where length%3!=0
For example for http://genome.ucsc.edu/cgi-bin/hgc?g=htcGeneMrna&i=ENST00000383614&c=chr6&l=30082338&r=30083996&o=ensGene&table=ensGene
>ENST00000383614
ccccagacgccgacgatggggtcATGGCGCCCCGAACCCTCCTCCTGCTG
CTCTCGGGGACCCTGGCCCTGGCCGAGACCTGGGCGGCCCCCCCCAAGAC
ACACGTGACCCacccccctctctgaacatgaggcataa
echo -n ATGGCGCCCCGAACCCTCCTCCTGCTGCTCTCGGGGACCCTGGCCCTGGCCGAGACCTGGGCGGCCCCCCCCAAGACACACGTGACCC | wc -c
88
but 88%3!=0
is it an error from the UCSC or am I missing something ?
this sequence you posted has a stop codon in position 87 of the nucleotide seq (84 starting counting from 0).
Could be an error from the UCSC: https://lists.soe.ucsc.edu/pipermail/genome/2010-May/022368.html
yes, that was an error "We have determined that the data as originally incorporated into the track was strangely annotated and Ensembl has since corrected the error. The track on our side will be updated (and this data corrected) at the next update"