Question

How is it possible that multiple proteins maps to single transcript ids ?

0

Entering edit mode

7.7 years ago

jack ▴ 990

Hi all,

I've downloaded transcript mapping to PDB annotation from GENECODE

When I look at the table, I see that one transcript id maps to multiple proteins. I was pondering about the reason behind of it. Here are some examples:

                  V1   V2
1  ENST00000567584.1 1UJU
2  ENST00000567584.1 1WHA
3  ENST00000567584.1 1X5Q
4  ENST00000567584.1 2W4F
5  ENST00000569114.1 1UJU
6  ENST00000569114.1 1WHA
7  ENST00000569114.1 1X5Q
8  ENST00000569114.1 2W4F
9  ENST00000564122.1 1UJU
10 ENST00000564122.1 1WHA

protein RNA-Seq sequencing ChIP-Seq • 1.5k views

ADD COMMENT • link updated 2.2 years ago by Ram 45k • written 7.7 years ago by jack ▴ 990

score 0 · Answer 1 · 2017-11-28

0

Entering edit mode

7.7 years ago

Hussain Ather ▴ 990

A single transcript can have multiple ORFs that can give rise to multiple protein sequences.

ADD COMMENT • link 7.7 years ago by Hussain Ather ▴ 990

score 0 · Answer 2 · 2017-11-28

0

Entering edit mode

7.7 years ago

Istvan Albert 103k

It helps to look up the definitions for each:

https://www.ensembl.org/Homo_sapiens/Transcript/Idhistory?t=ENST00000569114

https://www.ensembl.org/Homo_sapiens/Transcript/Idhistory?t=ENST00000567584

Note how in this case both are marked as "retired".

In summary the data mapping stores all mappings, even historical ones that are now obsolete.

ADD COMMENT • link 7.7 years ago by Istvan Albert 103k

0

Entering edit mode

This problem should be addressed in next Ensembl release: C: Ensembl ID converter not working?

ADD REPLY • link 7.7 years ago by GenoMax 153k