Hi, given a few different Illumina raw read sets in fastq format or whatever format:
How would you most efficiently determine the genetic distance between them?
I am looking for a method that might be expanded on, given an increasing number of genomes. In other words, I would like to calculate the distances between the first few genomes; with the next Illumina run, I would want to calculate distances within the new genomes and between the new and last runs, but I would not want to re-calculate the distances within the first set of genomes.
I thought that maybe making assemblies would be the most efficient over time in combination with Mummer, but I am wondering if there is an assembly-free way to do it too.
Thanks! These are really great ideas. I think I can do this in jellyfish...