Question

De-Novo Genome Assembly Vs Comparative Genome Assembly

2

Entering edit mode

13.5 years ago

Saad Khan ▴ 440

Which of the two genome assembly methods is better (more accurate) in cases where there are many potential reference genome available for a particular newly sequenced genome. Also if all the reference genomes belong to same genus how many of them should be considered while carrying out genome assembly.

assembly genome • 7.1k views

ADD COMMENT • link updated 11.3 years ago by Rohit ★ 1.5k • written 13.5 years ago by Saad Khan ▴ 440

score 4 · Answer 1 · 2011-06-15

The approach I used for our 454 sequence reads was de novo assembly followed mapping the resulting contigs against a reference genome using nucmer. I assemble de novo first to prevent any inversions or repeats in any reference genome making their way into my build.

One question is what effect do inversions present in a reference genome, but not in your genome, have on comparative assembly? I imagine it depends on the weighting comparative assembly software gives to read-overlap or paired-read data versus reference genome alignment. A software suite like AMOS does allow you to combine different genome builds together so it's worth experimenting perhaps?

score 2 · Answer 2 · 2011-06-15

2

Entering edit mode

13.5 years ago

Martin A Hansen 3.0k

Both approaches have issues, and it is not possible to say if one is the better. De-novo assembly fails in repetitive regions. Mapping assembly miss segmental insertions and rearrangements. You can try both approaches and see how the results compare. To my knowledge there is no software that does this in one go - which is a shame.

ADD COMMENT • link 13.5 years ago by Martin A Hansen 3.0k

2

Entering edit mode

And, it depends on the organism! Some organisms are much easier to assemble than others, in one project, two bacteria in the same family had very different assembly results - one assembling into a couple of contigs, the other ending up very fragmented. I'm with the "try both" camp.

ADD REPLY • link 13.5 years ago by Ketil 4.1k

score 1 · Answer 3 · 2011-06-15

1

Entering edit mode

13.5 years ago

Felix ▴ 90

It also depends on the sequencing depth of the newly sequenced genome. If you only have a low coverage (maybe <6x) the de-novo assembly quality will be dubious. I believe combining the two approaches would be the better strategy in this case.

ADD COMMENT • link 13.5 years ago by Felix ▴ 90

score 0 · Answer 4 · 2013-08-15

0

Entering edit mode

11.3 years ago

Rohit ★ 1.5k

If you have time, try to run a de novo assembly and then try to map it to a Reference.

Segemehl is one tool that helps out in finding insertions and deletions.

http://www.bioinf.uni-leipzig.de/Software/segemehl/

ADD COMMENT • link 11.1 years ago by Rohit ★ 1.5k