Cufflinks analysis for transcriptome assembly
2
0
Entering edit mode
7.2 years ago
qudrat ▴ 100

Sometime while doing Cufflinks for transcriptome assembly it pauses at certain point of time. Does anyone have faced it, please help me

RNA-Seq • 1.4k views
ADD COMMENT
0
Entering edit mode
7.2 years ago

For how long have you waited while it has been paused? It also pauses for me at a certain point during each run, but it always comes back provided that I have enough memory.

What specific command did you run/execute?

ADD COMMENT
0
Entering edit mode

I waited for at least 10 hour and i run the following command Cufflinks -o output -p 3 -M pseudogene.gtf accepted_hits.bam

ADD REPLY
0
Entering edit mode

How much RAM have you got?

3 CPU cores is very little. In the past, I used as many as 32 CPU cores (human genome) and it still took time.

For a bacterial genome 1,000th the size of a human genome, it can still take 2-3 hours for Cufflinks to finish.

ADD REPLY
0
Entering edit mode

Is there any way to split BAM file into two halves?

ADD REPLY
0
Entering edit mode
7.2 years ago

This is most probably because of highly expressed transcripts (like rRNA or those coming from mtDNA). To quickly check it you can summarize reads by featurecounts and see which genes have very abnormally high depth. Then you can mask that genes using –mask-file option in cufflinks http://cole-trapnell-lab.github.io/cufflinks/cufflinks/.

ADD COMMENT
0
Entering edit mode

Actually i run the -mask-file for pseudogene

ADD REPLY
0
Entering edit mode

Overexpressed genes are not necessarily pseudogenes. I my case out of 100mln reads 30mln reads were mapped to mitochondrial gene and cufflinks was getting stuck on that one. But after masking it the problem was solved.

ADD REPLY
0
Entering edit mode

Thank you, let me check it

ADD REPLY

Login before adding your answer.

Traffic: 2508 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6