Collapsing my paired-end reads before or after mapping ?

0

Entering edit mode

3.9 years ago

yolek64754 ▴ 30

Hi everyone,

I am about to map some Paired-End (PE) reads (aDNA) and I don't know what is the pro and cons about those two methods:

Run fastp (to remove the left-over adapters) -> map my PE dataset (using BWA) -> single BAM -> remove duplicates using Picard -> Mapping Quality filtering
Run Clip&Merge from EAGER (that collapse reads and trim adapters, specialised for aDNA) -> map my new SE dataset (using BWA) -> remove duplicates using Picard -> Mapping Quality filtering

My question is; is there any advantage for any of those two methods ?

Thanks a lot.

paired-end mapping single-end collapse • 1.7k views

ADD COMMENT • link updated 3.9 years ago by Biostar 20 • written 3.9 years ago by yolek64754 ▴ 30

0

Entering edit mode

aDNA = amplicon? If EAGER is a special pipeline for such data then use that.

ADD REPLY • link 3.9 years ago by GenoMax 147k

0

Entering edit mode

My bad, should not use abbreviations. aDNA stand for ancient DNA, 30-120bp damaged reads.

ADD REPLY • link 3.9 years ago by yolek64754 ▴ 30

0

Entering edit mode

I would still recommend using EAGER since it seems to be designed for the special type of data you have.

ADD REPLY • link 3.9 years ago by GenoMax 147k

0

Entering edit mode

seconding Genomax on this -- why would you not want to use EAGER? Are there any specific reasons why you think alternative ways might be better suited for the task at hand?

ADD REPLY • link 3.9 years ago by Friederike 9.0k

0

Entering edit mode

I meant, correct me if I am wrong but if we collapse reads before mapping, we can loose some information from sequencing errors ? I aDNA most of the time the reads are short so the reads are fully sequenced in Left and Right.

ADD REPLY • link 3.9 years ago by yolek64754 ▴ 30

1

Entering edit mode

EAGER must be taking that into account if you are following a standard protocol. That said every dataset has its own characteristics. You may need to start with #2 and if that does not work then try #1. I would still suggest this route.

ADD REPLY • link 3.9 years ago by GenoMax 147k

0

Entering edit mode

This would be the point where you could dig into the documentation of EAGER to find out how the collapsing is done. Presumably they are leveraging the fact that both reads tend to cover the same piece of a fragment.