I have single end RNAseq data that I have aligned with Tophat2 and now would like to use htseq to generate counts for EdgeR. I have read that it is necessary to sort the accepted_hits.bam
file by name
if using paired ended reads. http://www-huber.embl.de/users/anders/HTSeq/doc/count.html Further, since the default option for order
is name
, if I do not need to sort by name
for single end reads is it necessary to use the pos
option?
Currently, I am using the command below and receive the following counts:
htseq-count -m intersection-nonempty --format=bam tophat_Kalkaska_control/tophat_K18C/accepted_hits.bam PGSC_DM_V403_genes_strand_filtered.gtf > htseq_counts_control/K18C_counts.txt
Will do. Thank you so much.