Question

PacBio long read error correction tools

0

Entering edit mode

7.1 years ago

freddiejung ▴ 60

Dear all,

in my previous post (https://www.biostars.org/p/303214/#303734), some people suggested that it's better to correct errors in log reads from PacBio sequel using short reads from illumina ahead of the genome assembly.

I happened to find that the assembly with only short reads contains lots of mis-assembled loci. The assembled sequences did not match FISH data. Currently we suspect that the extremely similar repeated sequences dispersed among the genome caused the mis-assembly.

In this case, I felt that error-correcting software would not work well like LoRDEC or HALC that needs the assembly derived from short reads as input. Is this right?

If so, what kind of software is better to correct errors?

PacBio Long-read error-correction short-read • 2.8k views

ADD COMMENT • link updated 7.1 years ago by lieven.sterck 15k • written 7.1 years ago by freddiejung ▴ 60

0

Entering edit mode

What is the scientific/biological question you are looking for an answer to?

Depending on your question there are different initial error correction bfx pipelines that are reccomeded for PacBio data.

ADD REPLY • link 7.1 years ago by tjduncan ▴ 280

0

Entering edit mode

Did anyone use FMLRC (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5807796) or is HALC better?

ADD REPLY • link 6.7 years ago by Ric ▴ 440

score 0 · Answer 1 · 2018-03-15

0

Entering edit mode

7.1 years ago

lieven.sterck 15k

The build-in error correction procedure of the CANU pipeline works quite well. That is however not using illumina data but only PacBio.

ADD COMMENT • link 7.1 years ago by lieven.sterck 15k