Is this a bug of Cutadapt
1
0
Entering edit mode
10.1 years ago
zdyuan • 0

Hi,

I found I could't remove the 5' adapters of my fastq file by using Cutadapt. Is there any bugs or could you give me some suggestions to remove the adapters?

Thanks!

cutadapt --quality-base=33 -q 20 -a TGGAATTCTCGGGTGCCAAGG -g GTTCAGAGTTCTACAGTCCGACGATC -m 18 -M 26 C101.fastq -o C101_rm_adapt.fastq
cutadapt --quality-base=33 -q 20 -a TGGAATTCTCGGGTGCCAAGG -b GTTCAGAGTTCTACAGTCCGACGATC -m 18 -M 26 C101.fastq -o C101_rm_adapt_test.fastq
cutadapt --quality-base=33 -q 20 -a TGGAATTCTCGGGTGCCAAGG -b TTCTACAGTCCGACGATC -m 18 -M 26 C101.fastq -o C101_rm_adapt_test02.fastq

$ zcat C101_rm_adapt.fastq.gz|grep TTCTACAGTCCGACGATC | wc -l
2685599

$ grep TTCTACAGTCCGACGATC C101_rm_adapt_test.fastq | wc -l
2685599

$ grep TTCTACAGTCCGACGATC C101_rm_adapt_test02.fastq | wc -l
2828775

$ zcat C101_rm_adapt.fastq.gz|grep TTCTACAGTCCGACGATC | head

TTCTACAGTCCGACGATC
GTTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
GAGTTCTACAGTCCGACGATC
NGTTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC

$ grep TTCTACAGTCCGACGATC C101_rm_adapt_test02.fastq | head

TTCTACAGTCCGACGATC
GTTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
GAGTTCTACAGTCCGACGATC
AGAGTTCTACAGTCCGACGATC
NGTTCTACAGTCCGACGATC

$ grep TTCTACAGTCCGACGATC C101_rm_adapt_test.fastq | head

TTCTACAGTCCGACGATC
GTTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
GAGTTCTACAGTCCGACGATC
NGTTCTACAGTCCGACGATC
TTCTACAGTCCGACGATC
softwareerror • 3.0k views
ADD COMMENT
0
Entering edit mode

Can you please post the stats that cutadapt provides after execution? I think it writes them to STDERROR.

ADD REPLY
0
Entering edit mode

In addition please simplify the example to a single simple, minimal data run that shows the problem. The current output is way too complicated to follow and you are in fact reusing a filename - though that alone would not explain the problem.

ADD REPLY
0
Entering edit mode

There is no STDERRORs.

ADD REPLY
0
Entering edit mode
10.1 years ago
zdyuan • 0

I found Cutadapt couldn't clean the adapter sequences completely when both using -a and -g, but could trim the adapter step by step (test.fastq could be download from here):

## At the same time: `-a` and `-g`

$ cutadapt --quality-base=33 -q 20 -a TGGAATTCTCGGGTGCCAAGG -g GTTCAGAGTTCTACAGTCCGACGATC -m 18 -M 26 test.fastq -o test_rm_adapt.fastq

$ grep TTCTACAGTCCGACGATC test_rm_adapt.fastq |wc -l
418

## successively:`-a`, `-g`

$ cutadapt --quality-base=33 -q 20 -a TGGAATTCTCGGGTGCCAAGG test.fastq -o test_rm_adapt_a.fastq

$ cutadapt --quality-base=33 -q 20 -g GTTCAGAGTTCTACAGTCCGACGATC -m 18 -M 26 test_rm_adapt_a.fastq -o test_rm_adapt_g.fastq


$ grep TTCTACAGTCCGACGATC test_rm_adapt_a.fastq |wc -l
448
$ grep TTCTACAGTCCGACGATC test_rm_adapt_g.fastq |wc -l
0
ADD COMMENT
0
Entering edit mode

what is this GTTCAGAGTTCTACAGTCCGACGATC adapter?

ADD REPLY

Login before adding your answer.

Traffic: 1672 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6