Question

How to remove poly T in RNA-sequencing data

3

Entering edit mode

9.9 years ago

seta ★ 1.9k

Hi all,

I'm using Trim_galore tool for quality and adaptor trimming, but I found no option to remove poly T in the sequencing data resulted from poly A enrichment libraries sequencing. Could anybody please let me know how should remove poly T in data? Thanks

RNA-Seq alignment Assembly sequencing • 13k views

ADD COMMENT • link updated 2.4 years ago by Ram 45k • written 9.9 years ago by seta ★ 1.9k

1

Entering edit mode

Have you tried mapping the RNA-seq data yet? It may not be necessary to remove polyA stretches because they won't map to a unique location and you may already have enough reads to not worry about it. Another poster asked a similar question: Statistics About Poly-A Tails In Rnaseq Reads .

However there are other posts that may help you such as: Trim Poly-A Tail And Trailing Nucleotides

ADD REPLY • link updated 2.4 years ago by Ram 45k • written 9.9 years ago by Jason ▴ 940

Ram · Answer 1 · 2015-06-29

1

Entering edit mode

9.9 years ago

h.mon 35k

It is not clear-cut trimming poly-A tails is beneficial, are you certain you need to trim the poly-A tail? As Jason pointed, many short read mappers won't be affected by poly-A tails. For assembly, it may help discern splice variants (see comment on MIRA's manual).

ADD COMMENT • link updated 2.4 years ago by Ram 45k • written 9.9 years ago by h.mon 35k

Ram · Answer 2 · 2015-08-05

There is a tool available on Github for removing PolyA, PolyT, PolyC, PolyG

https://github.com/haploxer/after

Automatic Filtering, Trimming, and Error Removing for fastq data
Currently it supports Illumina 1.8 or newer format
AFTER can simply go through all fastq files in a folder and then output a good folder and a bad folder, which contains good reads and bad reads of each fastq file

Besides remove PolyX, it also can do:

Trim reads at front and tail according to bad per base sequence content
Detect and eliminate bubble artifact caused by sequencer due to fluid dynamics issue
Filter low-quality reads

Ram · Answer 3 · 2015-06-30

0

Entering edit mode

9.9 years ago

michael.ante ★ 4.0k

You can use bbduk from the bbmap suite. Just create a polyA.fa (e.g. >polyA\nAAAAAAAAAAAAA) and zip it into the bbmap resources folder; run then bbduk with ref=resources/polyA.fa.gz. Maybe it is necessary to add a polyT sequence into your fasta.

I'm quite sure you can add the sequences to the adapter-file of trim_galore likewise.

Cheers,
Michael

ADD COMMENT • link updated 2.4 years ago by Ram 45k • written 9.9 years ago by michael.ante ★ 4.0k

0

Entering edit mode

Thanks for all help.

ADD REPLY • link 9.9 years ago by seta ★ 1.9k

Ram · Answer 4 · 2016-06-12

0

Entering edit mode

8.9 years ago

goubert.clement ▴ 30

Hello,

You can also try UrQt.

It performs poly-N trimming as well as quality trimming, searching for the best larger fragment in the whole read. It has a high percentage of base conservation. It is pretty effective.

ADD COMMENT • link updated 2.4 years ago by Ram 45k • written 8.9 years ago by goubert.clement ▴ 30