Entering edit mode
2.6 years ago
genomes_and_MGEs
▴
10
Hi everyone,
I have a very large numeric matrix, with a structure like
1 3128 3e-06
1 14194 5.2e-05
1 14203 5.3e-05
1 2730 6.3e-05
1 2700 0.000156
Where one can see the genetic distance (column 3) between genomes listed as numbers (columns 1 and 2). Uploading the tree in cytoscape is impossible, since the text file is too big (~200Mb). Is it possible to filter the network, so that only clusters containing at least 20 genomes linked together are shown in the numeric matrix?
Thanks!
In this example, we can see the genetic distance between genome 1 (column 1) and other genomes (3128, 14194, etc). Is it possible to filter the network, so that only clusters containing at least 20 genomes linked together (i.e., all the genomes share genetic distance with one another) are shown in the numeric matrix? For example, if genome 2 is only linked to genome 45, then remove this line. But if genome 3 is linked to 50 genomes, and these genomes are linked between them, then keep these lines. Can someone help me with this?