Hi All,
I am new to RNA-seq and am looking for good practices for aligning RNA-seq data with different read lengths (2x50, 2x75, 2x100). The idea is to use STAR 2-pass (2.7.10b) as an aligner and RSEM for gene and transcripts quantifications, eventually using tximport for read count extraction. I read many forums, groups, the ENCODE and NCI GDC pipelines, and the STAR reference manual. However, the number of parameters that can be tuned is overwhelming. I know I should set the --sjdboverhang to length - 1 for genome generation (which can be omitted in the two STAR passes), but rather than this parameter, is there something else to which I should pay attention?
I appreciate any help you can provide.