Entering edit mode
10.8 years ago
ewre
▴
250
Hi all,
How can I get a gene phylogenetic profile matrix which contains present/absent information of genes in a handful of species?
Hi Asaf, I have downloaded the file you mentioned above, It has 3 columns, but I cann't figure out how to generate the matrix. can you provide some clues to do this?
The first column is taxonomy ID, the second is group of proteins and the third the number of representatives of the group in the genome. You can convert it pretty easily to a matrix form by putting True in the matrix where the TXID and the protein family appear in a row and False otherwise. You should need to program a bit.
I got the idea. what does the third column exactly mean in this table?
Number of representatives of the COG group in the genome