Entering edit mode
6.4 years ago
shuksi1984
▴
60
Can I get dummy fastq paired end human genomic data for DNA sequencing? Let me know any available source. The fastq file size is expected no be more than 1G each.
Did the answers in your previous question not help: Substitute of human whole genome fastq data.
Fastq file that I received was ~2GB each. But I want file size to be in MBs or KBs just for the sake to testing the tools in my application. Also, the fastq file should give some rsids and indels after passing through the pipeline. I used the SP1.fq from the given link enter link description here but this could not give any rsids after analysis.
You'd be best off following the advice from Sej Modha and using simulated reads then. Otherwise you'd need to start with a full dataset and filter out reads aligning to a couple example regions matching your needs.
You can use ART for simulating reads from a genome or you can download actual dataset from SRA or ENA databases.