I have looked and looked through biostars (and googled crazily) and for some reason I cannot find a complete list with file formats used in bioinformatics. There are some lists like http://bioinf.comav.upv.es/courses/sequence_analysis/sequence_file_formats.html or http://www.molecularevolution.org/resources/fileformats but they are rather incomplete or too narrowed.
I was expecting someone compiled a file format database, but I was very disappointed. Do you know more complete lists?
Thanks
A new program = a new format :-)
That's for a reason. Existing file formats are ridiculous! Come on, 'FASTA'? Is there a bigger mistake than this format? Then they replace it with a 'much better' format: FastQ. So, now they now store (large) BINARY data in plain text file! No wonder there are so many FastQ 'formats'. I don't know why bioinformaticians are so afraid of binary files! With the time wasted to scan a single line of text in a FASTQ file to find its true end (LF, CRLF, etc) a program could process over 100 entries in a binary file.
:)
also: http://bioinformatics.roslin.ac.uk/lawslaws/ "The first step in developing a new genetic analysis algorithm is to decide how to make the input data file format different from all pre-existing analysis data file formats.""