Difference Between Gene And Transcript In The Datasets
2
1
Entering edit mode
13.2 years ago
Ss ▴ 50

I have been working with different datasets for a month now but I get easily confused.

I need a dataset for genes in humans and I tried downloading it via ftp of different databases.

Ok, I understand there could be different exon, and splicing can result in different transcripts. But then it does have same start (bp) and end(bp). Shouldn't it be different among all the transcripts or I am missing something there?

I downloaded a dataset and a subset looks like this

Ensembl Gene ID    Ensembl Transcript ID    Chromosome Name    Gene Start (bp)    Gene End (bp)    Strand    Associated Gene Name    Associated Transcript Name    Status (gene)
ENSG00000016402    ENST00000316649    6    137321108    137366298    -1    IL20RA    IL20RA-001    KNOWN
ENSG00000016402    ENST00000468393    6    137321108    137366298    -1    IL20RA    IL20RA-002    KNOWN
ENSG00000016402    ENST00000367746    6    137321108    137366298    -1    IL20RA    IL20RA-003    KNOWN
ENSG00000016402    ENST00000461799    6    137321108    137366298    -1    IL20RA    IL20RA-005    KNOWN
ENSG00000016402    ENST00000460306    6    137321108    137366298    -1    IL20RA    IL20RA-004    KNOWN
ENSG00000016402    ENST00000541547    6    137321108    137366298    -1    IL20RA    IL20RA-202    KNOWN

In this example, chr_start and chr_end are same for all transcripts.

database gene transcript ensembl biomart • 10k views
ADD COMMENT
7
Entering edit mode
13.2 years ago

As it is written in the header "Gene Start (bp) Gene End (bp)" , your columns display the start and end position of the GENE (not the TRANSCRIPTs). You can clearly see here that all those transcripts (having different transcription start/end) belong to the same GENE (ENSG00000016402) on chromosome 6: chr6:137321108-137366298

ADD COMMENT
1
Entering edit mode
11.7 years ago
Raghav ▴ 100

As you mentioned, Difference between gene and transcript in the database, if we talking about gene in data base it means it have TSS position with 5'UTRs start codon............termination codon with 3'urts which have comparatively larger than our transcript which basically contains only CDS information that gene.

ADD COMMENT

Login before adding your answer.

Traffic: 1759 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6