I'm just starting out with RNA-seq, and I'm trying to get the installation of Tophat to work. I'm following the "Getting started" instructions on the site, and am currently testing the installation. When I run
tophat -r 20 test_ref reads_1.fq reads_2.fq
... this is what I get:
[2014-06-20 13:01:22] Beginning TopHat run (v2.0.11)
-----------------------------------------------
[2014-06-20 13:01:22] Checking for Bowtie
Bowtie version: 2.2.3.0
[2014-06-20 13:01:22] Checking for Samtools
Samtools version: 0.1.19.0
[2014-06-20 13:01:22] Checking for Bowtie index files (genome)..
Found both Bowtie1 and Bowtie2 indexes.
[2014-06-20 13:01:22] Checking for reference FASTA file
[2014-06-20 13:01:22] Generating SAM header for test_ref
[2014-06-20 13:01:22] Preparing reads
left reads: min. length=75, max. length=75, 100 kept reads (0 discarded)
right reads: min. length=75, max. length=75, 100 kept reads (0 discarded)
[2014-06-20 13:01:22] Mapping left_kept_reads to genome test_ref with Bowtie2
[FAILED]
Error running:
/Users/erikfasterius/bin/bam2fastx --all ./tophat_out/tmp/left_kept_reads.bam|/Users/erikfasterius/bin/bowtie2 -k 20 -D 15 -R 2 -N 0 -L 20 -i S,1,1.25 --gbar 4 --mp 6,2 --np 1 --rdg 5,3 --rfg 5,3 --score-min C,-14,0 -p 1 --sam-no-hd -x test_ref -|/Users/erikfasterius/bin/fix_map_ordering --bowtie2-min-score 15 --read-mismatches 2 --read-gap-length 2 --read-edit-dist 2 --read-realign-edit-dist 3 --index-outfile ./tophat_out/tmp/left_kept_reads.mapped.bam.index --sam-header ./tophat_out/tmp/test_ref_genome.bwt.samheader.sam - ./tophat_out/tmp/left_kept_reads.mapped.bam ./tophat_out/tmp/left_kept_reads_unmapped.bam
... and I don't really understand the error. It doesn't say what type of error it is, just says [FAILED]
and give the long string of whatever caused the error, which I don't understand.
Any help would be greatly appreciated!
Erik
You'll have to run
from the directory yourself to see what the actual error was.
Ok, I did that, and this is what it says:
Interesting, given that the command tells bowtie2 to not provide a header, it's surprising that
fix_map_ordering
would be expecting one. What happens if you remove the--sam-no-hd
portion from that line?This:
Followed by what seems to be the bowtie help (i.e. lots of rows of info on parameters), and then this at the end:
I'm guessing that you missed a pipe, try:
That gives the exact same as above (i.e. "100 reads, of these:" etc). (I'm very much appreciating the help, btw, even tho we/you haven't found the solution yet =P)
At this point I haven't a clue. If you post the reads you're using and let me know the genome they're from then I'll have a look. Barring that, you might try posting this on the google group.
The reads are
reads_1.fq
andreads_2.fq
from thetest_data
that can be downloaded from the Tophat tutorial, which also contains the reference (test_ref.*.bt2).For what it's worth, I can also reproduce this. Presumably this is specific to the test dataset, since otherwise I imagine someone would have reported this already. Either way, do ask on the google group that I linked to, since this really does seem to be a bug.
As an aside, I can recommend using STAR instead. It's vastly faster and produces equivalent to superior results.