Retrieving the corresponding Haplotype CDS from Ensembl
1
0
Entering edit mode
3.4 years ago
Joseph Hughes ★ 3.0k

Given a Ensembl protein identifier and amino acid substitution such as ENSP00000242351:701Q>E,851T>I, how do I programmatically retrieve and download the coding sequence (CDS) with the largest observed count.

Screenshot of the corresponding Haplotype CDS

I need to do this for a batch of different proteins*haplotypes so would like to use the REST API.

gene Ensembl protein Haplotype • 963 views
ADD COMMENT
1
Entering edit mode
3.4 years ago
Emily 24k

This REST API endpoint gets the haplotypes per transcript. The protein haplotypes have the associated cds haplotypes stored as hexes, which you can link to the cds haplotypes.

ADD COMMENT
0
Entering edit mode

So would this be taking the hex for the ENSP00000242351:701Q>E,851T>I in 'protein_haplotypes' and finding it as 'other_hex' in the 'cds_haplotypes'?

ADD REPLY
1
Entering edit mode

yes, this would be it. Or you can go the other way, and get the other_hex from the protein_haplotype and find the cds_haplotype it's the main hex for.

ADD REPLY

Login before adding your answer.

Traffic: 2977 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6