Separating mixed bacterial 16S RNA sequences from eukaryotic 18S ITS sequences?

0

Entering edit mode

8.5 years ago

Lionel12 • 0

A bit of back ground - I have been given thousands of reads in FASTQ format containing sequences (16S and 18S ITS) sequenced by ILLUMINA. The adapter indices have already been removed.

The plan is to separate bacterial and fungal sequences and then BLAST them to determine community composition. Although the adapter indices arent present, the Euk sequences should contain an Illumina bottom primer sequence as well as the ITSF and ITS2 primer sequences in both orientations.

What would be a method to extract sequences which contain these sequences (taking into account that some of these reads are not exact leading to a degree of error).

Any help would be greatly appreciated!

ITS sequencing 16S • 2.2k views

ADD COMMENT • link updated 2.0 years ago by Ram 45k • written 8.5 years ago by Lionel12 • 0

1

Entering edit mode

I would use the SIlvaNGS pipeline and sort out Bacterial and Euk seqs. Just an idea.

ADD REPLY • link 8.5 years ago by dago ★ 2.8k

0

Entering edit mode

Just thinking out aloud. These are not the solutions:

Why not blast them as is (against a db of 16S + 18S ITS)?
Try bbsplit method described in this thread using 16S reference DB: BBSplit syntax for generating builds for the reference genome and how to call different builds. Some bacterial sequences may escape.

ADD REPLY • link 8.5 years ago by GenoMax 150k

Login before adding your answer.