Entering edit mode
6.8 years ago
navela78
▴
70
Is there a place where I can read about the technology stack used in NCBI, especially in the backend, for their major databases like Genbank, GEO, etc. They are doing an amazing job and I would really like to learn how they do what the do :)
I don't recall seeing any information about the actual infrastructure (hardware/software) NCBI uses. Information Resources Branch (IRB) appears to be responsible for that aspect. Federal contractors probably do the actual management work.
In a way this is surprising. All of data (with some exceptions) is publicly available but there appears to be a paucity of information about hardware/software NCBI services run on.
Yes it is surprising!
It would be interesting to know more. This article (2009ish?) has a couple comments from NCBI IEB chief on SRA infrastructure (SQL Server relational meta-db + filesystem storage type design).
Thanks for the article. Do you know anything about genbank?
Anyone from NCBI or a contractor for NCBI want to take a shot at this? Perhaps we can start with Genbank ... No need to into details ... just a quick overview of technologies ..
I found this article with some information about how NCBI web blast queries are handled technologically.
Looks good! They don't explain genbank etc. in such detail.