I am trying to identity all the amino acids in each protein in the human genome in which the residues are split between two adjacent exons.
To illustrate the problem, suppose I have the following peptide PNKCSGMRFP
Suppose that the residues PNKCS were encoded by exon 2 but the codons encoding "G" amino acid (which is encoded by the GGA codon) was split either as G|GA (the G on the left side of "|" is encoded by exon 2 and the "GA" on the right side of "|" is encoded by exon 3) or as GG|A.
Thank you
Lee