Question

Formula For Converting Sequence To Physical Coverage

0

Entering edit mode

11.7 years ago

Dan ▴ 540

If I have reads of length x, inserrt size of y and genome size z, how do I convert from sequence to physical coverage?

sequencing • 5.8k views

ADD COMMENT • link updated 11.7 years ago by Istvan Albert 102k • written 11.7 years ago by Dan ▴ 540

2

Entering edit mode

maybe you miss n the number of reads?

ADD REPLY • link 11.7 years ago by Ido Tamir 5.2k

0

Entering edit mode

n is implied by the coverage

ADD REPLY • link 11.7 years ago by Dan ▴ 540

score 1 · Answer 1 · 2013-11-15

1

Entering edit mode

11.7 years ago

Istvan Albert 102k

N = number of reads, L=length of each read, G=genome size

The "normal" definition of coverage would be C = N * L / G

Now if you want to extrapolate from that you could use the insert size for L but then you would need to also divide N by 2 as two reads form a pair. This would be more of a fragment coverage rather than actual base coverage.

ADD COMMENT • link 11.7 years ago by Istvan Albert 102k

0

Entering edit mode

Yes, that's what I call physical coverage. So I need to write two equations and solve to get the ratio? Is the ratio directly proportional to L1/L2?

ADD REPLY • link 11.7 years ago by Dan ▴ 540

0

Entering edit mode

instead of read length use fragment length, though it is not clear from your description what you mean by insert size? how do you know that? If it is by alignment then note that it could be wrong, hence extrapolating from that to physical coverage may also be incorrect

ADD REPLY • link 11.7 years ago by Istvan Albert 102k