Question

How to remove non-unique mapping reads with Hisat?

1

Entering edit mode

9.4 years ago

Niek De Klein ★ 2.6k

Is there an option in Hisat that is similar to -g/--max-multihits in Tophat? I can not find it in the parameter list on https://ccb.jhu.edu/software/hisat/manual.shtml#options. There is a Uniqueness class in here: https://github.com/infphilo/hisat/blob/master/unique.h, but when I grep for that class or the bestIsUnique function in the Hisat source folder I can't find it used anywhere.

Or would it work to use samtools view -q 1?

If anyone else is interested, I e-mailed the author and he said:

One possible solution is to use the next best alignment score (ZS:i:, previously XS:i:). If the next alignment score is the same as the primary alignment score (AS:i:), that means there are at least two equivalent alignments found by HISAT. This additional SAM field (ZS:i:) is only available at HISAT github, which I'll include in the next release of HISAT.

alignment RNA-seq hisat • 8.3k views

ADD COMMENT • link updated 23 months ago by Ram 44k • written 9.4 years ago by Niek De Klein ★ 2.6k

1

Entering edit mode

I am currently not using Hisat after finding that it's treatment of multi-mappers is somewhat lacking. I guess this is some kind of speed vs. accuracy trade-off. Personally I prefer an algorithm that is slower but guarantees to find the best N mappings in the genome.

ADD REPLY • link 9.4 years ago by Christian ★ 3.1k

0

Entering edit mode

9.4 years ago

vivekbhr ▴ 700

There is the -k option to inquire a given number of alignments as output. However,since HISAT doesn't finds/reports alignments in a specific order, the -k cutoff won't necessarily give you "best" alignments. So I would not recommend using -k 1 if you want to find the best unique alignment for each read, but it's faster if you want to know whether a reads aligns or not.

Hope this helps.

ADD COMMENT • link 9.4 years ago by vivekbhr ▴ 700

0

Entering edit mode

8.5 years ago

bimbam42 • 0

Hej but doesn't -k 1 mean that it will go for the first hit rather than the best? Does this also apply for HISAT2?

ADD COMMENT • link 8.5 years ago by bimbam42 • 0

score 5 · Accepted Answer · 2015-07-08

5

Entering edit mode

9.4 years ago

thomas.smith2 ▴ 120

The most recent release of hisat (v0.1.6) uses NH tags in the BAM to specify the number of alignments for the read/pair (see release notes: https://ccb.jhu.edu/software/hisat/index.shtml). As far as I understand it, reads/pairs with NH:i:1 have a single best alignment and are therefore "uniquely" mapped.

ADD COMMENT • link 9.4 years ago by thomas.smith2 ▴ 120