I get different ENSG IDs for the same MSTRG ID
1
1
Entering edit mode
5.4 years ago
Apex92 ▴ 300

Hi everyone.

I face a problem like I get Different ENSG IDs for the same MSTRG IDs. I used StringTie as the assembler but at the * StringTie_merged.gtf * file I get this problem that I have same MSTRG ID for genes that have a different name and different ENSG IDs. Does it seem that StringTie gives same MSTRG ID for those genes which overlap? How can I solve this problem?

Thank you in Advance.

RNA-Seq Assembly rna-seq alignment next-gen • 1.7k views
ADD COMMENT
0
Entering edit mode

hey, did you find an answer to your question? I have the same question

ADD REPLY
1
Entering edit mode
4.2 years ago

You seem to have cases of merged genes. Basically some genes are joined together by StringTie/Cufflinks because of genomimc overlap of associated transcripts.

To solve this I have just release an update to the R package IsoformSwitchAnalyzeR (available in >1.11.6) which can fix problem 1 and 2 for most genes. You simply use the importRdata() function - which will fix the isoform annotation which is fixable and clean up the rest of the annotation. From the resulting switchAnalyzeRList object you can analyse isoform switches with predicted functional consequences with IsoformSwitchAnalyzeR or use extractGeneExpression() to get a gene count matrix for DE analysis with other tools.

Hope this helps.

Cheers

Kristoffer

ADD COMMENT

Login before adding your answer.

Traffic: 2235 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6