Hi all,
I try to identify the ortholog of a particular gene across genomes of many species. However, most of the genomes I am working with are unannotated.
What tools or workflows would you recommend for finding orthologs in unannotated genomes?
From my research, tools like OrthoMCL seem to require annotated genes or, at the very least, an identification of coding regions. Given that I don't have this information, I am unsure how to proceed.
I was considering using tools like MiniMap2 to align the gene directly to the genomes, as it supports long-read alignments. Would this approach work? I assume this approach would tell me whether this gene is present or not in the genome?
Could anyone provide guidance or suggest alternative tools/methods that might work in this scenario?
Thank you!
Thank you for the suggestion!
I have tried it on more than 300 genomes now. Only several genomes got positive result. It seems hard to judge whether is false negative (because I use the default settings for all species), or it is really missing from a particular species.