Hi there,
I'm completely new as-of-this-morning to ddRADseq, but am trying to get my head around the theory. If I have a 30 Mb genome and use ddRADseq with ~6500 digestion sites, how many segregating SNPs can I find (roughly)? WGS shows that there's roughly 20000 SNPs separating each isolate.
Does this depend on the amount of the reference genome covered by the ddRAD?
Any help is greatly appreciated - thanks.
6500*200 may be ? Because ddRad seq is generally PE sequencing.
Right, that's why I included the "If you sequence fragments that are 150-200 bp with 100 bp paired-end reads, you'll sample more of the genome and have more SNPs."
I thought it might be helpful to start with the simpler case of always being 100 bp to show how it is done. Sequencing a size range is less exact (150-200 bp fragments with PE) since it depends on the distribution of fragment sizes.
Ah, I see! That makes complete sense, thank you for going through that.
In reply to the 6500 digestion sites, I would actually have more because I'd use two cutters (as you mention). 6500 was just for the one cutter. But thank you for your reply, the theory makes sense now.