Hi all,
I would like to make Venn-diagram by pairwisely comparing orthogroups generated from 4 different strains. I have run Inparanoid and got a list of all paired orthogroups (e.g., SQLtable). However, I am in trouble making Venn-diagram by determining "Core", "Accessary", and "Singleton" from the output from Inparanoid. If you have a similar experience, could you please share your idea with me?
Or, if you help me how to transform the output table from Inparanoid below, I may go further by myself.
[Original]
1 tfs232
1 tfs235
2 tfs148
2 tfs146
3 tfs139
3 tfs178
3 tfs179
*The total table length is ~1000 and the No. ranges from 2~5. So, I cannot just extract odd and even rows.
[This is what I want to make]
1 tfs232 tfs235
2 tfs148 tfs146
3 tfs139 tfs178 tfs179
Thanks.
Here's how you can do it : awk '{tass[$1]=tass[$1]"\t"$2}END{for(i in tass){print i"\t"tass[i]}}' your file then to know which are "Core", "Accessary", and "Singleton" it depends on your data.