Hi. I'm trying to calculate my sequenced bacterial genome's genome coverage (for NCBI submission) but I'm not sure if I got the right answer. According to biostars.org/p/208339 , the formula is C = R X L / G where C is the coverage, R is the total number of reads, L is read length, and G is the genome size.
The total reads (R) that I got for my genome is around 31,000,000 while the read length (L) is 100 according to the sequence length I retrieved from FastQC when I submitted one paired-end read of my genome. The genome size (G) I got after assembly is around 4,000,000 bp.
If I use the formula, the answer would be 1,550 = ((2100) 31,000,000) / 4,000,000. I think I may have the wrong answer as I noticed from other whole genome sequencing studies that their genome coverage is only around 100x-300x or it even goes as low as 12x. Did I really have the wrong answer? Thank you in advance.
while a theoretical formula is quite nice, you should (IMO) plot / look at actual coverage to confirm. map your reads to a reference, and use e.g. mosdepth to calculate coverage or just open bam file in genome browser to see coverage