Entering edit mode
3.1 years ago
robert.murphy
▴
90
It is my understanding that k-mer coverage with regards to sequencing data is simply the distribution of k-mers across the raw sequencing reads and have nothing to do with coverage of the resulting assembly?
To that end, what is the normal upper limit of a k-mer coverage search and how to you pick what k-mer to search at? I assume the physical limit is simply k-mer = read length?
Lastly, what are the major benefits of doing this? I can understand why one would look at coverage of the actual assembly to identify any low coverage regions but what does a distribution of k-mers tell us about the raw data?