Where Can I Get Some Illumina Casava Input Files?
1
0
Entering edit mode
12.6 years ago
Misha ▴ 20

I am a software developer who is investigating software optimization techniques. I got maximal marks in high-school biology and moved onto period 3 elements.

I was told that CASAVA produces enormous output files, and I am interesting in procuring some kind of file that will produce an enormous output file. I want terabytes or a few hundred gigabytes, at the least. The validation examples don't do this.

Are there some publicly available files that can do this? Are there some genomes online?

==Edits==

  1. I want to profile certain Disk I/O performance characteristics when writing large files. I have a harddrive with lots of wires and a some fancy equipment.
  2. I understand that Casava takes in BCL files and spits out FASTQ. I want it to spit out a large FASTQ file. SO I guess I need the BCL files? Is there a public repository?
• 3.0k views
ADD COMMENT
1
Entering edit mode

What is the input that your algorithm needs? Are you looking for FASTQ files, or something else?

ADD REPLY
0
Entering edit mode

Can you fill us in on what you are trying to accomplish?

ADD REPLY
0
Entering edit mode

If you are only doing read/write benchmarks generate the data on the fly with a small program and write it out. Thats easier than download in correct sizes or trim it to the correct sizes.

ADD REPLY
1
Entering edit mode
12.6 years ago

Thanks for editing your post.

Option 1 generate fake fastq files to you specification. This was covered here:

biostar - What NGS read simulators are available for paired-end data?

Option 2 grab some fastq data from Short Read Archive:

SRA

Option 3 find someone who has casava files.

Happy benchmarking.

ADD COMMENT
0
Entering edit mode

Doesn't CASAVA output FASTQ? I'm interested in benchmarking CASAVA itself, so I need fake BCL files..?

ADD REPLY

Login before adding your answer.

Traffic: 1375 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6