Very quick question:
I am using this method to calculate coverage:
Coverage = N_reads * (average_read_length / haploid_genome_size)
when calculating physical coverage for paired-ends, is "average_read_length" the average of the DNA fragment lengths (length of read1 + insert length + length of read2) or is it the length of the reads (read1+read2)
I think it's the fragment length/template length, right?
Does "N_reads" represents the number of reads or the number of fragments/spots that were sequenced?
You would not know the length of the fragment. It would be plain
average_read_length
in terms of bases. This would be a rough estimate ofcoverage
for the genome based on the number of bases you sequenced and the size of the genome. The formula may need to beCoverage = (N_reads * average_read_length) / haploid_genome_size