I am working on identification of bacteria/ viruses in infectious material. Have reads of 2X100 bp data. Let us say the bacterial genome size is 10Kb. I am wondering what is the criteria to choose insert size- Should it be as long 0 to 1000bp or it should be small based on read length/ total genome size. Is there a formula/ rule of thumb to follow.
Thanks