Question

hisat2 salmon alignment error

0

Entering edit mode

4.7 years ago

evelyn ▴ 230

Hello Everyone,

I have made bam files using hisat2 and then I am trying to use these bam files for salmon alignment.

hisat2-build genome.fa hisat
hisat2 -x hisat -1 R1_paired.fastq.gz -2 R2_paired.fastq.gz -S out.sam
samtools view -bS out.sam > out.bam

java -Xms1g -Xmx3g -jar picard.jar MarkDuplicates \
I=out.bam \
O=marked_duplicates.bam \
M=metrics.txt

salmon quant -t transcripts.fasta -l A -a marked_duplicates.bam -o out.sf

But it gives an error:

Please provide a reference FASTA file that includes all targets present in the BAM header
If you have access to the genome FASTA and GTF used for alignment 
consider generating a transcriptome fasta using a command like: 
gffread -w output.fa -g genome.fa genome.gtf

Thank you for any help!

RNA-Seq • 2.5k views

ADD COMMENT • link updated 4.7 years ago by Rob 6.9k • written 4.7 years ago by evelyn ▴ 230

score 1 · Answer 1 · 2020-03-27

1

Entering edit mode

4.7 years ago

Rob 6.9k

You aligned your reads to the genome with HISAT2, and you are trying to quantify the transcriptome with salmon. The references are not the same, and salmon does not (currently) support consuming genome-aligned BAM files for transcriptome quantification. You either need to align the reads directly against the transcriptome (which could be done with HISAT2, or with a tool like Bowtie2), or use the STAR aligner and take advantage of it's built-in option to project genomic alignments onto transcriptomic coordinates using the --quantMode TranscriptomeSAM option.

ADD COMMENT • link 4.7 years ago by Rob 6.9k

0

Entering edit mode

Thank you! Can --tmo/--transcriptome-mapping-only from hisat2 do the same thing as --quantMode TranscriptomeSAM from STAR.

ADD REPLY • link 4.7 years ago by evelyn ▴ 230

1

Entering edit mode

No, this will simply prevent HISAT2 from aligning reads outside of annotated transcripts. These will still be spliced alignments in genomic coordinates. As far as I know, HISAT2 has no option to project genomic alignments to transcriptomic coordinates, so the only way to accomplish this with HISAT2 would be to align the reads directly against the transcriptome rather than the genome.

ADD REPLY • link 4.7 years ago by Rob 6.9k

0

Entering edit mode

Thank you! I am using transcriptome to align the reads. The .bam files are made but I saw this error in the log file.

(ERR): hisat2-align died with signal 9 (KILL)

Are these .bam files okay to use further.

ADD REPLY • link 4.7 years ago by evelyn ▴ 230

0

Entering edit mode

Why do you even align the data? I suggested already in a previous thread to use salmon directly on the fastq files. There is no need for alignment if you quantify with salmon.

ADD REPLY • link 4.7 years ago by ATpoint 86k