I'm trying to map short reads to a genome graph constructed from multiple whole genome alignments of A. thaliana. What is the best practice method to produce the alignments in a reasonable time frame. I've been mapping just a subset of 5000 pairs to a single chromosome graph. vg map takes >24h to complete and enormous resources (up to 500G of RAM on a single core). Mapping a full sequencig run to the full genome graph took over 7 days to complete. Any help would be appreciated. Thanks!!!
Hello, can i have your feedback about this question please ? and what worked the best for you
Speed is still an issue with
vg map
. If your graph is complicated, it can be intractable to map read pairs withvg map
. Sometimes the reads can often still be mapped as single-ended reads, although at the expense of mapping rate. The alignment algorithm can also be swapped out for a faster (but less accurate) algorithm with--xdrop-alignment
These days you might have better luck with the
vg giraffe
mapping tool instead ofvg map
.vg giraffe
tends to be much faster (closer to the speed ofbwa mem
). However, some of the indexes invg giraffe
can be difficult to build on very complicated graphs.