If I have reads of length x, inserrt size of y and genome size z, how do I convert from sequence to physical coverage?
If I have reads of length x, inserrt size of y and genome size z, how do I convert from sequence to physical coverage?
N = number of reads, L=length of each read, G=genome size
The "normal" definition of coverage would be C = N * L / G
Now if you want to extrapolate from that you could use the insert size for L
but then you would need to also divide N
by 2 as two reads form a pair. This would be more of a fragment coverage rather than actual base coverage.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
maybe you miss n the number of reads?
n is implied by the coverage