Hi, I'm running vg autoindex --workflow giraffe
to try to create a graph reference using a VCF of variants I have that I later want to align some reads to with giraffe
. I tested it out with a few variants on one chromosome, and this took a very long time with a lot of memory. I was wondering if it was possible to parallelize constructing a full graph reference by chromosome and merging the results into one reference at the end.
I tried looking through vg combine --help
but it seems to only support combining .vg
files rather than the .gbz
, .dist
, and .min
files produced using the vg autoindex
command.
What's the best way to scale up creating a graph reference from a VCF using the tool? Would combine
still work, or is there another subtool to help here?
Thanks!
Thanks, I didn't realize
autoindex
was already running in parallel. I increased the memory for my task and it was able to complete.