Find related samples based on a list of variants
1
1
Entering edit mode
9.3 years ago
igor 13k

Is there a tool that can take a list of mutations (in a VCF file or just a list of positions) from multiple samples and then cluster the samples based on how closely related they are?

This may not be very complex, but as far as I know, it would involve a few steps and I don't want to reinvent the wheel, especially if I may not do in the most appropriate way. I assume someone has already automated the process. Maybe even with really nice graphs as an output.

vcf variants • 1.4k views
ADD COMMENT
0
Entering edit mode

"Yes, this is not very complex" , AKA "I have discovered a truly marvellous proof of this, which this margin is too narrow to contain." :-)

ADD REPLY
0
Entering edit mode

I guess I meant "this may not be very complex". The reason for my assumption is there does not seem to be an available solution, which usually means it's either trivial or extremely difficult (which is definitely not the case here).

ADD REPLY
1
Entering edit mode
9.3 years ago

Sounds like a phylogenetic tree would work. SNPhylo (reference here) can take VCFs as input and create distance tree graphs.

ADD COMMENT
0
Entering edit mode

Sounds very promising. It looks like it needs a single multi-sample VCF, though. Or can it take multiple single-sample VCFs?

ADD REPLY

Login before adding your answer.

Traffic: 1967 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6