How to get Ensembl ids in cuffdiff output?
1
0
Entering edit mode
7.8 years ago
LuisNagano ▴ 90

Hello everyone, I was wondering if there was a way for me to run Cufflinks/Cuffdiff pipeline, so that the output has, besides gene names, the Ensembl id too? (Or Refseq id, Entrez id)

I ran the tophat using Ensembl bowtie2index, "GRCm38_MusMusculus", and I used "gencode.vM12.annotation.gtf" as reference annotation to run Cufflinks and Cuffmerge, but I only get it in my final cuffdiff output (gene_exp.diff), the gene names and XLOC ids, like this: enter image description here

Thank you for help!

Cufflinks Cuffdiff RNA-seq Ensembl id • 3.4k views
ADD COMMENT
1
Entering edit mode
7.8 years ago
shunyip ▴ 250

You can add a gene annotation file (.gtf) that contains Ensembl transcripts into your input, using the -g argument.

ADD COMMENT
0
Entering edit mode

I used (.gtf) of Ensembl and UCSC to run cufflinks and cuffmerge, but it did not work anyway, I just have XLOC ids in my cuffdiff output. Ensembl ids and Nearest_Ref ids only appear on my cufflinks and cuffmerge output, but they disappear on my final output CuffDiff, which I need.

What command line do you use to run cufflinks, cuffmerge and cuffdiff?

My cufflinks output: transcripts.gtf enter image description here

My cuffmerge output: merged.gtf enter image description here

My cuffdiff output: gene_exp.diff enter image description here

My cuffdiff output: genes.fpkm_tracking enter image description here

ADD REPLY
0
Entering edit mode

Cuffmerge assigned new IDs to your genes because they are merged and changed in the output. Because the genes' locations are edited, they are not equivalent to the Ensembl genes anymore, so, Ensembl IDs should not be used in those cases.

Usually, when I use cuffmerge, I simply want to find novel genes. In such a case, I simply use bedtools to find genes that do not overlap with the Ensembl gtf file. So, I never needed to match their IDs.

In your case, maybe you can simply append the novel genes (obtained by bedtools intersect -v argument) onto the ensembl gtf, then use it for cuffdiff?

ADD REPLY
0
Entering edit mode

I had a similar problem of not getting appropriate gene ID in my cuffdiff output; check the following post;

How to retrieve gene ID, gene description, after cuffdiff analysis

may be the suggestions given therein could be of help to you

ADD REPLY

Login before adding your answer.

Traffic: 2368 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6