Hello,
I am interested in studying co-occurrence of gene clusters (BGC's) between Streptomyces species. In a general note I was planning on doing is taking the results from the BiG-SCAPE analysis after running Antismash on Strepto genome data; BiG-Scape provides an output as an Domain Sequence Similarity Index and Jaccard Index.
Possibly take these values and try to calculate some type of relationship between BGC's, possibly through Bayesian statistics, partial correlations or even use a machine learning approach but I am not sure on this. I am sort of new to this field of bioinformatics so any input is welcome.
Thanks so much.