Hello,
I'm new to bioinformatics, I have a question which may seem easy. For my RNA-seq analyses, I was using HISAT2 with already-indexed reference genome downloaded from HISAT2 website for Homo sapiens. And as GTF I was using the one I downloaded from Ensembl. After some runs, I thought this usage might be wrong, because I downloaded reference genome and GTF from different sources (HISAT2 website does not have GTF) and they might not be matched. So I've decided to download a new reference genome from Ensembl and index it myself, and using the corresponding GTF file from Ensembl with the right version number (v93). Is there any incompatibility between my previous GTF Homo_sapiens.GRCh38.92.gtf
and reference genome? How should I use them?
Always use the same version of the genome and the annotation.
simple suggestion go for gencode you can find the genome assembly and gtf file at the same place of various corresponding versions
Usually for any incompatibility, HISAT2 report warning or errors. But as juke-34 pointed out correctly, it is always advisable to use same version of genome and annotation file to avoid any further error.
The "tran" versions of the indexed genomes on the HISAT2 website contains the GTF annotation files. Its probably easier to use these. Link to download GRCh38 index + GTF.
It is really easy to use them, but isn't it outdated? (It was released more than 2 years ago according to its date on ftp)
Now, there are 3 ways that I consider to use the reference genome. I'm really confused right now.
gencode.v28.annotation.gtf
, FASTA file which contains all chromosomes (Genome sequence (GRCh38.p12):GRCh38.p12.genome.fa
)How should I evaluate these options? HISAT2 option seems the most comfortable one, but I don't want to use outdated one.
Hi Batu,
I am stuck in the same situation as yours and surprised how come no one else is reporting this incompatibility. I am using HISAT2 GrCh38 release 84 version and using same version of GrCh38.84 version of gtf from Ensembl to be sure. But it is so outdated as the latest version now is 97! Please update the thread if you have found a solution.
Thank you.