Hi everyone, This is my first post on this forum, so I wanted to welcome and greet the whole biostar community. Currently in my work I am dealing with the synthesis of eps by soil bacteria. I wanted to compare gene regions covering eps synthesis genes for the whole genus but I don't quite know where to start. Should I download all available genomes for a particular genus or only the reference ones? Do you build a local database for BLAST or is it not necessary ? Knowing that individual genes may or may not be present in particular species, how to determine the gene range to be compared ? And finally, what software would you recommend for the comparison itself. I know Mauve and EasyFig. Or maybe something else ? I will be extremely grateful for all the answers !
I managed to move on. I downloaded the reference genomes using ncbi-genome-download and loaded them into mauve. But I have a problem with the gbff files. Individual replicons (chromosome, plasmids) with annotation are loaded to mauve as separate sequences. Do you know any way to combine these sequences into one without losing the annotation ?
Hi I know that Mauve is very specific in terms of input files. It only recognizes .fna and .gbk. Try uploading .gbk files. Make sure you have Genbank full files
Hi, thanks for your response. I managed to merge particular replicons into single .gbk file and perform alignment for a few genomes but doing that for let's say 50 genomes seems to be extremely time-consuming.