What is the node_id in vg giraffe output (gaf or gam)?
0
0
Entering edit mode
18 months ago

Hi,

I am playing with the genome graph I built from 2 different individual whole genome sequencing data. Also, I generated 1 million simulated reads. However, while I tried to map the simulated read back to the genome graph, the node_id in either gaf or gam format matches the original segment_id in my gfa file. I did check quite a few reads and all of them behave this way.

My question is, is this intended? Any suggestions would be greatly appreciated. Thanks!

giraffe vg • 934 views
ADD COMMENT
0
Entering edit mode

The node IDs in VG don't necessarily correspond to the node IDs in an input GFA, because VG will chop long nodes into shorter ones for computational reasons. If you want the alignments to have the original segment IDs from vg giraffe, you can use vg autoindex on the GFA to get a segment file (along with the other indexes), and then supply it to vg giraffe with --named-coordinates.

ADD REPLY
0
Entering edit mode

Thank you so much. Your solution works!

ADD REPLY

Login before adding your answer.

Traffic: 2125 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6