Question

Where to get list of all Illumina adapters

0

Entering edit mode

9.3 years ago

umn_bist ▴ 390

Previously I have been using FastQC to identify overrepresented sequences (adapters) to feed cutadapt tool for trimming.

Right now I have about 50 bam files (sequenced by Illumina) that need quality control including removing adapter seq, but I cannot go through FastQC individually to identify each "unique" adapters.

That said, I am wondering if it's possible to preload a long list of all known Illumina adapters into cutadapt? If so is there a simple text file that holds these adapters? If this is not the best method I am open to alternative solutions. Thank you guys for your help.

RNA-Seq Illumina cutadapt adapters • 16k views

ADD COMMENT • link updated 2.8 years ago by Ram 45k • written 9.3 years ago by umn_bist ▴ 390

Ram · Answer 1 · 2016-02-02

2

Entering edit mode

9.3 years ago

trausch ★ 1.9k

illumina adapter sequences are available here.

ADD COMMENT • link updated 5.3 years ago by Ram 45k • written 9.3 years ago by trausch ★ 1.9k

0

Entering edit mode

Thank you for your reply. Is there a place to download a fasta file of all illumine adapters in RNA seq samples? I had previously been to that link but I am curious if a special format is necessary to build a custom fasta. If not, which libraries are considered relevant for RNA samples? Thanks again.

ADD REPLY • link 9.3 years ago by umn_bist ▴ 390

3

Entering edit mode

PDF seems like the dumbest format ever for a list of primers. Has anyone come across this table in Excel, tsv or some other format that makes sense?

ADD REPLY • link 8.5 years ago by Maximilian Haeussler ★ 1.7k

Ram · Answer 2 · 2016-02-02

1

Entering edit mode

9.3 years ago

Juke34 9.2k

You have to use the -a parameter in cutadapt: http://cutadapt.readthedocs.org/en/stable/guide.html#removing-adapters

-a AdaptorList.fasta

ADD COMMENT • link updated 5.3 years ago by Ram 45k • written 9.3 years ago by Juke34 9.2k

0

Entering edit mode

Thank you for your reply. Does the fasta file for adapter list require a special format? Is it as simple as creating a txt file with a list of all adapters and changing the file extension to fasta? I can only guess cutadapt or any clippers won't know how to differentiate a 3' adapter from a 5' adapter. Which Illumina libraries are considered relevant for RNA samples? The samples were retrieved from CGHub (unaligned) so I don't know which Illumina version these samples used. From the link posted above I found TruSeq, TruSeq short, and Oligonucleotide Sequences for the v1 and v1.5 Small RNA Kits. Thanks again.

ADD REPLY • link updated 5.3 years ago by Ram 45k • written 9.3 years ago by umn_bist ▴ 390

1

Entering edit mode

Does the fasta file for adapter list require a special format?

strange question ... it requires FASTA format.

>TruSeq Adapter, Index 11
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGGCTACATCTCGTATGCCGTCTTCTGCTTG

If you don't know which adaptors have been used, you put all in the fasta file. It should work fine. For differentiation between 3' and 5' adapters, I don't know...

ADD REPLY • link updated 5.3 years ago by Ram 45k • written 9.3 years ago by Juke34 9.2k