Has anyone dealt with Nextera transposase sequence contamination before? Wondering if this is something we should work to eliminate. What would be an acceptable level of contamination?
Thank you
Has anyone dealt with Nextera transposase sequence contamination before? Wondering if this is something we should work to eliminate. What would be an acceptable level of contamination?
Thank you
I just had a sample where 90% of the output was this crap... Even so, after removing it, the data was similar to another technical replicate that did not have the transposase, so I think your levels are usable.
What I used is SeqPrep, so:
SeqPrep -A GTGTATAAGAGACAG -B CTGTCTCTTATACAC -f in_r1.fastq -r in_r2.fastq -1 out_noTP_r1.fastq.gz -2 out_noTP_r2.fastq.gz
I do not know where you got these two sequences (-A and -B) from, but for Nextera, you have to trim CTGTCTCTTATA for both the forward and reverse read. You can have a look here at the Illumina Adapter Guide for the Nextera adapter sequence, if you want to check it in detail.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Hi all,
I've just found in Nextera™ DNA Sample Prep Kit (Illumina®-Compatible)
Looks like It is not true that this sequence is NOT sequence if we have it in our reads... according to this paper it shouldn't?
Does this could be cased by library contamination?
Best,
Agata
It's not sequenced if your inserts aren't shorter than the read length. Essentially, you only sequence it if you read through the insert in the other adapter. Which matches the observations here: the adapter is reached at the ends of the reads.
What the text you copied means it that it isn't sequenced at the beginning of the read.