Hi,
I am currently new to the field of bioinformatics. While searching for sequence alignment I came across some alignment algorithms like BWA, Velvet , MAQ , SOAP etc.
I am interested in doing performance comparison of these tools via profiling myself (not so deeply but somewhat getting a rough idea).
For this purpose, I require benchmarks of human unaligned DNA (preferably short sequences) in FASTA file format. I don't want sequences of a specific kind but general enough to compare relative average performances.
After lot of searching on Google,Bing,Duck duck go, I wasn't able to find anything. Could you tell me
- What kind (Reference DNA , short DNA etc.) of database is required for the job?
- How many of them would suffix for the job?
Take care that the average and general performance analysis needs to be done so databases have to be general(not confined to a particular job) and also they should not be similar(difference should be there for complete performance analysis).
P.S. Give somewhat greater emphasis to BWA , BWA-SW
Thanks
You need fastq most likely not fasta.