Dear, all,
The de novo transcriptome gave me the following amino acid sequences. Many of the amino acid sequences start with methionine, but some of them do not start with methionine like this.
What is this protein?
>At_DN540_c0_g1_i10.p1 AEDLRETAGEYADSAKEKAENLKNRAKEHLPDSDTLSKTKESLKKKASDGKEYVQEKAEDLRETAGEYADSAKEKAENLKNRAKEHLPDSDTLSKTKESLKKKASDGKEYVQEKAEDLRETAGEYADSAKEKAENLKNRAKEQAENLKDNVKDKYDEYTDSAKEYANDAKDKVKSKTEEYTGSAKKTANDVKDKVKSKANDLKEGAQGMSESVSDSAREGFHHVEDKAEDIKEAGEQKKEEIDAKAEEAKSTIGGKIKSAADTVVGGVKSAAESVGSTVSGVFSSASDKADQAKEDVKEKAEEKREEIKESVRRKRETVGEKIDSNIEDAKSKASDAKAAAGEKIDEAKEKLNRFRRHTSAEEAGEKAGSTIDRAREKASEAGQAVGDKAKELKDDVTNRMKRAENDTVSGGGSKIGEGLAEIGGAAKTGAANAGATVVGGVIFAGEKVGEGAKAAKDKTVETAQAVGDKASEAAEAARQKADDAKNSFGKTVSFNTRSDTSIQNLGNL*
Thank you for reading.
Hi.
When I was evaluating coding sequence from ensembl transcripts, I found some transcript coding sequence do not start with ATG. They start with GTC, CAC, GGC... etc. The peptide sequence sometimes start with X, H... etc.
Some examples:
ENST00000466610
(start with H),ENST00000680216
andENST00000704018
(start with X) ,ENST00000685033
(start with G).I can't find a reason and these transcripts are not using rare start codon either. Would you mind explaining a little bit more? Thank you.