Question

Poly-A trimming - is it necessary?

0

Entering edit mode

5.7 years ago

ika ▴ 50

I see poly-A trimming listed as a recommended step in many RNA-Seq protocols. However, I can imagine that trimming e.g. all As from the ends of a sequence could ultimately lead to false results if what is being trimmed is not a poly-A tail but a repetitive A sequence within mRNA. This might cause reads to align non-uniquely or trim reads to a length that would be filtered before alignment. While trimming all A's is not the typical approach I've seen, this would also theoretically be possible if A k-mers were used. I couldn't find a research paper where this was tested.

What is your opinion on this?

Is there maybe a k-mer length where poly-A trimming specificity is optimal?

Thanks for any input!

RNA-Seq rna-seq alignment sequencing mrna • 4.4k views

ADD COMMENT • link updated 5.4 years ago by lieven.sterck 16k • written 5.7 years ago by ika ▴ 50

1

Entering edit mode

Scan data with fastqc if polyA contamination is an issue. If not, so not coming up as overrepresented sequence, don't do any trimming. Same goes for adapter contamination. Only act if it is an issue. Otherwise leave data as is.

ADD REPLY • link 5.7 years ago by ATpoint 90k

score 2 · Answer 1 · 2020-02-25

I personally never did (or even heard about doing) specific poly-A trimming of reads.

In any case, nowadays most aligners are able to soft-clip the reads when aligning them and as such the polyA stretch will likely never cause any issue as it is not derived from the genome. Moreover, trend is to do less and less pre-filtering on reads

bottom-line I wouldn't care to much about it