I'd like to build a parser to load all output from various sequencer technologies into an database. There are many various file formats. I am mostly familiar with FASTQ and FASTA/QUAL produced by Illumina and 454, respectively. What raw sequence file formats do you work with?
You want to put some FASTQs (or whatever) in a database ? why ? what kind of data to you need to index ? What kind of information do you need to quickly find that won't be available using samtools or the Samtools API ?
Forget about the database. I really just want to know what kind of raw sequence file formats are common.
The reason being that I need random access.