Hi all, I need to create simulated paired-end sequence data with fixed read-lengths on each end (e.g., 75mers on each end of a 500bp DNA fragment, a la Illumina). Does anyone know of a reliable simulator that can generate paired-end sequences to a requested depth, with a requested insert size/variance and error rate, for a requested genome in a FASTA file? The output would preferably be two FASTQ files, one for each end.
I can write my own, but do not want to re-invent this boring (though useful) wheel. Any clues?
See also the following thread discussing read simulation with quality scores: http://bit.ly/kNePbA