Hi,
I have multiple paired-end files from RNA-Seq which I have aligned using hisat2
and then converted to bam via samtools
. Now I am confused whether I should use all the reads (QC-passed) or only properly-paired reads for further analysis.
Please advise.
Thanks.
For differential expression, there's nothing wrong with using all the reads. After all, one mate still mapped and provides information to help quantify gene expression. Paired-reads are great for inferring splicing events, but I'm not sure they help quantify expression significantly better than using single-end reads. You may want to consider whether some of the downstream tools you will use can work with BAM files which have mixed paired and unpaired reads.
@James Ashmore: Ok, thanks.I am planning to use Stringtie and Ballgown