Find structural variants comparing two assemblies
1
3
Entering edit mode
5.2 years ago
crimsontabaq ▴ 70

Is there a rapid way of comparing two large assemblies quantitatively, like generating a file with all large structural variants, or SV - gaps, inversions, etc., and theirs coordinates? The idea is basically to find out what one assembler particularly failed to assemble, compared to another one. Not using reads but assemblies alone, e.g. by breaking one of them on small pieces and use them as pseudo-reads.

There are ways of comparing assemblies, but they are not precise (e.g. dotplot, just visualizes but doesn't provide number and type of SV) and are finite (e.g. GMASS score or likehood test) - one cannot analyze what is different, but just HOW different.

This is simple task and quite typical for evaluation of assemblers' output, but I couldn't find any working solution. GATK finds only small SVs and poor at finding inversions, mummer's show-diff has the same problems + running two long on large genomes.

Assembly genome comparative genomics • 1.8k views
ADD COMMENT
1
Entering edit mode

You could combine something like aligning the de-novo contigs to your reference with minimap2 and then using https://github.com/eldariont/svim

ADD REPLY
1
Entering edit mode
ADD COMMENT

Login before adding your answer.

Traffic: 2137 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6