I'm comparing NGS data from whole genome sequencing. The genomes come from bacteria which caused an outbreak and one or more "normal" strains from the same species. The department I work for would like to see if there are regions of DNA that differ. However, finding SNPs and indels are not useful to our research because they are too small. I'm looking for smaller sequences (like genes), that are only present in the outbreak and absent from the other genomes or the other way around.
My problem is this. I cannot find any suitable tools or algorithms that can be used to find this regions. I was thinking about using BLAST but I wonder if there are tools available for this purpose. Maybe it's because I'm a intern and not fully aware of the many, many tools out there and the correct terminology (yet!).
I assessed tools like breseq, genomeVISTA and Harvest. All these tools focus on small mutations like indels and SNPs, and therefore no good fit.
Can you help me out? Can you make some suggestions or give some names of tools I can look into? My thanks in advance.
Have you tried mauve?