Hi all, hope you guys are doing fine in the pandemic.
So I have just recently beginning to move into computational biology stuff (and found it amazing) and one of the first things Im trying to do is to map fastq files against the reference human genome. For that I'm using HISAT2 (because I'm running it on my own PC :/). I have done and ok mapping using the index "genome_tran" available in hisat2 own repository (http://daehwankimlab.github.io/hisat2/download/). But then after discussing a little bit I got really confused regarding the other indexes availabe ("genome" and "genome_snp" for example) and I could not understand the difference between them in the table at the end of the page. Can someone give a light on this? They all map against the genome?
The table:
genome: HISAT2 index for reference; genome_snp: HISAT2 Graph index for reference plus SNPs; genome_tran: HISAT2 Graph index for reference plus transcripts; genome_snp_tran: HISAT2 Graph index for reference plus SNPs and transcripts;
For example, if I'm mapping against genome_snp, will I not get the transcripts levels bc they are only available in "genome_tran"?
Thanks
Hi Vitor, can you please give more details as to what exactly you are trying to study and why you chose Hisat2 for that? You could consider maybe using Kallisto if you are interested in transcript-level abundance.
Hi, I'm interested in analyzing transcripts levels and a little bit of alternative splicing as for example using portcullis (https://portcullis.readthedocs.io/en/latest/) and similar stuff, and therefore I need the .bam files. I have choose hisat2 bc its a little faster than STAR and less resource dependant (for what I have read, and because I'm running at home PC), but thanks for the suggestion.