Hello.
I've been recently dealing with a fundamental problem, related to biology - mapping from protein back to DNA. I realize this is currently not precisely possible, yet I still seem to miss some vital information regarding this subject.
If I understand correctly, for annotated exons, codon to protein mapping should be available, so mappings from genomic locations to protein sequences. Where can one find this type of information?
Thanks for any clarification.
I reformulated the question, sorry for the confusion.
You can download everything you want from Ensembl. The gene locations are in the GTF file. The protein sequence are in "protein sequence (fasta)" files.
Aaah. I missed the fact that fasta sequences also include genome regions. This explains how to get from gene sequence to the protein sequence, yet the exact nucleotide-to-amino-acid mapping is not available if I am not mistaken? (so one-to-one mapping from nucleotide to AA)
There isn't always a 1:1 mapping, given overlapping transcripts, so no you won't find anywhere something like:
Thank you, this clarifies it for me.