What is a gene transcript?
4
10
Entering edit mode
7.7 years ago
Dave ▴ 100

Hi,

I've recently started using Ensembl for help in designing a gene panel for NGS.

Each time I select a gene on Ensembl it comes up with different available transcripts for that gene.

What defines a transcript? Is it literally just a different version of the gene in different individuals (but if so, how come some of the transcripts appear to be so different in terms of bp length and amino acid number)

Any help in explaining would be most gratefully received!

sequencing ensembl gene transcript • 37k views
ADD COMMENT
0
Entering edit mode
ADD REPLY
0
Entering edit mode

Very interesting discussion, thank you very much for your knowledge. But I want to ask, if indeed a gene is actually a certain sequence of nucleotides or nitrogen bases in DNA, how do we know that this sequence is this gene, that sequence is that gene? And does anyone want to explain to me, b3a2 or b2a2 on the bcr abl gene fusion, is it true if I say it is the result of a transcript of the fusion of the gene or the truth is that it is a fusion of the gene (because genes consist of exons and introns)?

ADD REPLY
9
Entering edit mode
7.7 years ago

This is more of a comment, possibly a rant. In my opinion the concept of gene is obsolete and we would be better off if we ditched the concept of "gene" altogether.

Genes made sense when it was thought that there were these discrete units (genes) which produced each a single transcript and a single protein. However, a single gene can produce multiple transcripts and these can be very different one from the other (see for example the table of transcripts of the Actin gene, there are several coding and non coding transcripts). So when we say gene X has a mutation it is not clear what we are referring to. How many transcripts are affected by this mutation? Are they coding or pseudogenes? In my opinion it would be simpler to think in "transcript space" and forget about genes.

Another way if seeing this is to consider that while transcripts exist, genes don't exist. When you do mRNA extraction you isolate transcripts, not genes, you can't isolate genes. You could have a restriction enzyme that cuts left and right of a "gene", but this is also not true. You have an enzyme that cuts at positions A and B, which happen to include a bunch of transcripts. If the definition of that gene (and transcripts) change, the enzyme still cuts there because it doesn't "see" genes, it sees DNA sequence.

I think "genes" hang around because they make some statements simpler ("Mutation at position A hits gene X" as opposed to "Mutation at position A hits an intron of transcripts X, an exon of transcript Y, and a UTR of Z"), but they are at best incomplete statements.

Any thoughts?

ADD COMMENT
7
Entering edit mode

The original definition of a gene comes from the field of classical genetics where it is the unit of inheritance of a phenotypic trait. So any statement about mutation in gene X is actually perfectly clear from that point of view. The problem is to link this definition to its molecular underpinning. It seems different people do it differently. I like the definition used by EnsEMBL that a gene is a locus that produces a set of similar and functionally-related transcripts. I think the notion of gene is still useful despite its fuzziness not least because it is a convenient level of data integration as long as one is careful to not mix definitions.

ADD REPLY
0
Entering edit mode

I also like the EnsEMBL definition you quote. Do you have a link for the original definition ? I couldn't find it after 5 min of googling.

ADD REPLY
2
Entering edit mode

It's in the section describing the genome annotation pipeline although it is not exactly stated like I did. I first got this definition from a discussion with someone at the EBI some years ago but this is also apparent from the way the EnsEMBL data is structured. If you're interested in digging into this issue, you may also want to read this paper which reviews the gene definition history and concludes by suggesting a new definition:

The gene is a union of genomic sequences encoding a coherent set of potentially overlapping functional products.

which to me is another way of stating the definition I mentioned above.

ADD REPLY
8
Entering edit mode
7.7 years ago

A single gene can produce multiple different RNAs (i.e., transcripts). The actual transcript observed will depend on the tissue, developmental time point, and environmental/hormonal/etc. factors. Typically there's a single major transcript expressed in a given cell at a given time, but not always. For further details check out wikipedia.

ADD COMMENT
0
Entering edit mode

Thanks for the reply Devon!

Just to confirm that i have understood...: the sequence i am reading when i pick a transcript is not in fact the genomic sequence from the reference genome, but represents the RNA transcript from that gene? if this is the case then why do the transcripts include the introns also, is it supposed to reflect pre-splicing?

ADD REPLY
0
Entering edit mode

The transcript sequence usually doesn't include intronic sequence (though an intron in one transcript can be an exon in another). If it does then it's pre-mRNA.

ADD REPLY
5
Entering edit mode
7.7 years ago
mirza ▴ 180

To understand this in general terms, information in DNA is encoded into RNA which is also called as transcript (as genomic information is "transcribed" into this) which directs the protein formation. "Gene" is a genomic sequence that consists of 2 types of regions- sequences that are expressed or encoded as RNA i.e. transcript, called as exons alternating with regions that are not encoded in the RNA/transcript called as introns. There are several introns & exons in a gene (I1-E1-I2-E2-I3-E3-I4-E4-I5) and different combinations of exons get encoded into the RNA/ transcript e.g. E1-E2-E3 or E1-E3-E4 or E1-E2 depending on the tissue etc. as written above, hence the difference in sequences and lengths.

ADD COMMENT

Login before adding your answer.

Traffic: 1880 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6