Removing Duplicates for Variant calling when using genes as a reference?
1
0
Entering edit mode
7.2 years ago
DR ▴ 10

Hello everybody!

I am planning to perform variant calling on a gene catalog that I retrieve from metagenomic samples. I wonder how would duplicates removal or marking could affect this. The point is that my reads have overall read length of 250bp ,thus I thought high proportion of secondary alignments may be real alignments to neighboring genes rather than real PCR duplicates during library preparation and I do not know to what extent removing would decrease sensitivity for detection. Do you have any suggestion?

Many thanks!

SNP next-gen sequencing gene • 2.4k views
ADD COMMENT
0
Entering edit mode
7.2 years ago

Hi,

The removal of PCR / optical duplicates is a topic that always makes for a good debate. The exact wet-lab process that was performed is critical.

What I suggest that you do is first detect PCR duplicates with Picard MarkDuplicates and then gauge whether or not you should remove them. In some circumstances, again based on the wet-lab process, it's just not feasible or correct to remove duplicates.

At the end of the day, if you do extensive testing with and without duplicates, I think that you'll find that your results will mostly stay the same.

Kevin

ADD COMMENT

Login before adding your answer.

Traffic: 1930 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6