snp phylogenetics vcf to tree
1
6
Entering edit mode
7.7 years ago
nepgorkhey ▴ 130

Hi All,

I identified SNPs from about 60 bacterial genomes using GATK and the same reference genome was used throughout. I currently have separate VCF files with SNPs for each bacterial genome. My main goal is to construct a maximum likelihood phylogenetic tree using these SNPs. I am now kind of stuck thinking how to proceed. Based on my understanding, I now have to merge the VCF files (which I did) and now I am not sure how to feed these files to a phylogeny software to construct a tree?

I was thinking to parse the vcf files to extract the reference and alternate nucleotide and generate a fasta sequence that can be used for phylogenetic analysis. But I am not sure if that is the correct method or is there any scripts or software already available to do this?

As I ran the SNP workflow in GATK separately for each genome, I am wondering if I should run the workflow for all the genomes in a single run?

Looking forward for suggestions.

SNP phylogenetics vcf • 3.5k views
ADD COMMENT
0
Entering edit mode
2.5 years ago

for those who in the future may read this post , check this one for an answer Multi-Sample Vcf To Phylogenetic Tree. , hope it helps !

ADD COMMENT

Login before adding your answer.

Traffic: 2172 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6