Hi all, i have been given 50 E.coli genome fastq files. But I don't have so much idea about background of sequencing. The information I have is that they're illumina reads with average length 250bp. Now i want to check coverage of all the genome. I did like this:
coverage = (read count * read length ) / total genome size.
where read count =(wc-l xyz.fastq)/4
read length =250
total genome 5.2million bp
Can any one please suggest me i am doing write way or my calculation wrong??
http://genescripts.com/calculating-genome-coverage.php http://davetang.org/muse/2015/08/05/creating-a-coverage-plot-using-bedtools-and-r/
Please how can i determine the coverage using software