Entering edit mode
3.4 years ago
pragnapcu
▴
10
Can someone please tell me from where can I get the WES or WGS dataset of size <1GB
Can someone please tell me from where can I get the WES or WGS dataset of size <1GB
Just browse sra-explorer.info for datasets. I doubt you can meaningfully query for file size as this depends on coverage and read length. Just download any from the species you need and then downsample. Just use head
on the fastq files to get like the first 4000000 lines which equals 1mio reads. Reads in fastq files are random and not ordered other than its position on the flow cell, hence you do not strictly need a dedicated sampling approach, especially if this is just for some practicing task.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
what do you mean WES dataset?
whole-exome sequencing data or whole-genome sequencing data. Please correct me if I am wrong. I have started learning NGS analysis just now. I sincerely appreciate ur suggestions.
You can always downsample a larger dataset.
seqtk
,reformat.sh
from BBMap are a couple of possible tools.yeap but it can be that you need FASTQ files < 1GB, or BAM files < 1GB (which is pretty unrealistic), or VCF files < 1GB. All such files are possibly to be generated from WES