Hi!
I'm working with tuberculosis data, we have SNPs, MIRUs, RFLP and Spoligotype data.
I would like to do a supertree (phylogenetic approach in which many overlapping trees are combined to produce a single tree) combining these four methods.
I've read some papers and I've notice the Matrix Representation with Parsimony is very used.
I have about 1500 samples for 90 SNPs. I've done an individual tree for SNPs with Biopython and Networkx.
answering Leszek comment:
the output of MIRU (Mycobacterial Interspersed Repetitive Units) are numbers (1 to 25)
the output of Spoligotype (0 and 1 - presence and absence)
Does anyone has done something like that?
Or has any direction?
Any help is welcome.
Thanks :)
How many samples do you have? And how many SNPs per sample?