After parsing genbank file and blast output(xml file) using biopython library, how can I calculate the number of linked gene cluster at different stringent level (0 to 100) increment of 10. Example
0 (stringent level),13456 (number of gene cluster) 10,234 20,234 30,200 40,190 50,187 60,187 70,100 80,95 90,55 100,45
BLAST is not a "sensu stricto" clustering tool but a pairwise aligner, so: how do you define a cluster in your case ? Something generated with OrthoMCL or BlastClust ?
I define it as homologous gene groups.