I am going to start the whole genome study and I like to have an estimation about the storage which is used for analyzing, I know it depends on several things like coverage, single end or paired-end and so on, but if anyone can help me and give an estimation it would be great!
I like to have WGS from 100 individuals with 50X coverage and paired-end.
It may be helpful to look through your own datasets to see what you have. Extrapolate from those numbers and add 10% to account for variation.
mentioning which genome/species you're working on might be the crucial bit of info here.
for viral genomes a few GBs will do, for conifer genomes for instance go think the in the range of several TBs
I am working on the Human Genome.