Hi, I intend to run a relatively large-scale genotype-to-phenotype assessment (hundreds of genomes. dozens of phenotypes) and would like to use kover to provide targets for the bases of the phenotypes. My problem is that the dataset covers different species, rather than strains of the same species, so the sequences are much divergent, and so I don't think that running kover the 'default' way would work too well. To overcome that, I was thinking about using the amino acid sequences of proteins identified in each species, but I am not sure if kover would accept that as a valid input to the '--from-contigs' option? Alternatively, would it work fine if I passed the k-mer matrix for amino-acid sequences to '--from-tsv', rather than nucleotides? Thanks in advance for all your help!
Thank you, that helps and clarifies a lot!