Hi,
I am dealing with RNA-seq PE data. I see some of my samples have poor quality bases at the end of reverse reads. How does TopHat deal with such reads? Are these bases clipped while mapping? and how does that affect the mapping quality?
Thank you
AFAIK tophat does not perform soft or hard clipping. So, you'll get better read mapping if you clip the low-quality bases yourself. The way I do it is to perform clipping from the end of the read and retain the read if the clipped read length is >= 50 bases, else remove it.