Entering edit mode
6.3 years ago
mitchez
▴
20
Hallo, I am interested in the algorithmic details about the BLAST tool. So far, I understood that BLAST processes the query sequence and stores words with high scores to search them in the database.
My question is : What kind of data structure is used by BLAST to preprocess the database sequences ? Also a hash-table ? How can BLAST quickly find occurences of seeds in database sequences ?
Is makeblastdb building this data structure ?
Thanks !
Source code for
blast+
is available here, if you are a programmer and wanted to take a look yourself.