Joining Paired-End Illumina Raw Reads
3
2
Entering edit mode
10.7 years ago
Yongjie Zhang ▴ 110

I recently have amplicons sequenced (Illumina PE250) to investigate microbial community. I want to know what quality detection, and what kind of trims of raw reads by what programs should be performed before joining paired-end raw reads? I think low-quality parts, potential barcode or primer sequences existed in raw reads should be excluded in advance. Is this right?

illumina paired-end • 8.7k views
ADD COMMENT
1
Entering edit mode

I finally merged reads pairs using usearch -fastq_mergepairs with -fastq_allowmergestagger enabled, but before merging, I cut primer/adapters using CutAdapt and trimed low-quality regions from ends of reads using Trimmomatric, and after merging, I cut potential primer/adapter again by CutAdapt and filtered reads with >0.5 expected errors by usearch -fastq_filter. Then I can do downstream analyses.

ADD REPLY
1
Entering edit mode
10.7 years ago
Janake ▴ 170

Here is a good example to start: http://www.mothur.org/wiki/MiSeq_SOP

ADD COMMENT
0
Entering edit mode

Very useful link. Thanks. I found the join_paired_ends.py in Qiime is not very ideal to my data. Mothur might be better.

ADD REPLY
0
Entering edit mode
10.7 years ago
Danielk ▴ 640

You could use SeqPrep for this. see https://github.com/jstjohn/SeqPrep or my fork https://github.com/dakl/SeqPrep which adds two parameters for more stringent handling of mismatches (set to N, essentially).

ADD COMMENT
0
Entering edit mode
10.7 years ago
Joseph Hughes ★ 3.0k

You should be able to find everything you need for removing adapters, splitting the sample based on barcodes and joining the pairs with a combination of the tools in the fastx_toolkit and ea-utils.

ADD COMMENT

Login before adding your answer.

Traffic: 2152 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6