Hi All,
I have some hundreds of pair end illumina fastq files with really low coverage 0.5x (Skim Seq). I am wondering if discoSNP could make a decent work calling SNPs from these samples. Or which software you recommend me?
Would be good to be able to call SNPs using even a single read. Because we have several individuals from the same population we could then filter the SNPs in terms of mayor and minor frequencies to keep only the most reliable SNPs (present in higher frequencies in the population, of course may be losing some real low frequency SNPs)
Also to avoid SNPs comming from paralogous sequences, I could filter the reads from mapping files and be more estrict in terms of percent of identity and percent of the lenght of the alignead read. Also, becasue I am working with complex genomes (maize) may be would be a good idea to determine the mappabilty of the genome to keep only regions with "good" mappability.
Now I am wondering which softwore would me allow to call SNPs even from a single read, and with no so many filters, at least at ethe beggining for a initial SNP calling.
Best Wishes,
Eric