Hey guys,
I been doing some OrthoMCL analyses (stand-alone version) with 4 different species. Now I would like to build a venn diagram showing the statistics of the comparison (e.g. numbers of clusters shared by all the species, number of clusters only shared by two species, and so forth). I haven't came across a way to split the "groups.txt" output file to get this numbers.
Any thoughts?
Thanks in advance,
J.
Does the groups.txt file follow the format of each ortholog group being listed at a single line? If yes, are your protein IDs species specific? If yes, then getting these statistics ought to be rather easy with e.g. awk
No, that's the problem, species are mixed through the file
And yes, my protein ID's are species specific