Hi,
Could anyone please explain how to use Hisat2 when both the genome and transcriptome is available for the same reference strain. The genome is annoated using maker and hence have the gene models in gff format. The transcriptome is also available but is present only as an assembled fasta format. Is it OK to merge both the genome + transcriptome files together and build an index?
Thanks for your time.
Thanks Devon for your quick reply. The % alignment rate was very low (~2-20%) when the sample reads were mapped against the individual genome fasta file, however, the alignment % significantly increased (~60-80%) when the sample reads were mapped against the transcriptome alone. I am unsure which one to use for downstream differential expression analysis using Stringtie etc. ps: I did not use the gff for building index, only used either the genome or transcriptome fasta file for creating index.
Any suggestions please? Many thanks
It sounds like the genome assembly isn't very good. Align against the transcriptome and don't use stringTie, but rather salmon.
Will give Salmon a try, thank you!