Hello,
I'm using the Ensembl Perl API provided by Ensembl in order to collect information about a set of genes (ENST IDs), all the transcripts from those genes (ENSG IDs), and all the variants from those transcripts (rs/cos/... IDs).
I'm actually looking for an efficient way to get the protein sequences translated from each transcript variant (variations included), i.e. one protein sequence per transcript variation.
Being unable to directly get the whole protein sequence neither the nucleotide sequence of a transcript variation in order to translate it (maybe I missed something), I had the idea of using a Slice object between the start and end positions of each transcript then translate the returned sequences based on the location and type of each transcript variation. But I realized it was tedious to handle frame-shift variants and some variation cases.
That's the reason why I'm looking for a more direct and efficient way to get those protein sequences.
If someone has the answer or any suggestion, I would greatly appreciate it.
Regards,
Nathan
You should post this as a new question in a separate thread.
I think I'm giving my solution to the original question. I'm just advising that my answer could be improved by other user and I'm expecting for that. I'm new in biostar so sorry if I've made a mistake.
Cheers,
Fran.
I did not realize that you were actually offering a solution since the sentence at the beginning of your post is unclear. My apologies.