Cufflinks: Why in the skipped gtf file showed a portion of genome masked
1
0
Entering edit mode
9.9 years ago
mjoyraj ▴ 80

I mapped my RNA-se reads with Tophat with the commands

Tophat -p 8 -o output -I 20000 -g 5 -G gtf_file Genome Fastq1 fastq2

Then I assembled the aligned tophat bam file using cufflinks with the commands

Cufflinks -p 20 -o output accepted.bam file

I found some of the genes are not assembled and they are masked as can be seen in skipped.grf.

I wonder why this has happened. Can anybody suggest something

Cufflinks Assembly RNA-Seq • 3.4k views
ADD COMMENT
2
Entering edit mode
9.9 years ago
  1. That tophat command is appropriate for PE mapping. You need to separate the fastq files with commas (no spaces!).
  2. If coverage is too high then cufflinks will skip a region.
ADD COMMENT
0
Entering edit mode

Yes, I observed the skipped region are mostly my target genes which are highly expressed (as can be seen from mapped reads). So I am trying again cufflinks-assembly with "--max-bundle-frags 1000000000000".

ADD REPLY
1
Entering edit mode

Something like that, I always have to look up cufflinks parameters.

ADD REPLY
0
Entering edit mode

Thanks.... Ryan

ADD REPLY
0
Entering edit mode

Dear Ryan, when I used --max-bundle-frags 1000000000000 in cufflinks, the run continues for a long time and it is still running showing waiting for two threads to complete. Is it okay?. Do I need to increase the number of threads?

ADD REPLY

Login before adding your answer.

Traffic: 2732 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6