What Are The Public Sql Servers For Bioinformatics ?
9
22
Entering edit mode
14.7 years ago

Do you know any public scientific SQL server ?

For example, I would cite:

(I'll give a +1 to each correct answer)

online-server sql • 13k views
ADD COMMENT
10
Entering edit mode
13.4 years ago

1000 Genomes: since June 16, 2011: http://www.1000genomes.org/public-ensembl-mysql-instance

mysql -h mysql-db.1000genomes.org -u anonymous -P 4272
ADD COMMENT
6
Entering edit mode
13.4 years ago

Ensembl and EnsemblGenomes both have MySQL access to the main dbs and mart dbs:

mysql -hensembldb.ensembl.org -P5306 -uanonymous
mysql -hmysql.ebi.ac.uk -P4157 -uanonymous

MySQL DB (BioMart)

mysql -hensembldb.ensembl.org -P5316 -uanonymous

EnsemblGenomes

mysql -hmysql.ebi.ac.uk -P4157 -uanonymous
ADD COMMENT
1
Entering edit mode

I am afraid that is not completely correct ....

Ensembl up to version 47:

mysql -hensembldb.ensembl.org -P3306 -uanonymous

Ensembl up from version 48 onwards:

mysql -hensembldb.ensembl.org -P5306 -uanonymous

Ensembl BioMart up to version 47:

mysql -hmartdb.ensembl.org -P3316 -uanonymous

Ensembl BioMart from version 48 onwards:

mysql -hmartdb.ensembl.org -P5316 -uanonymous

See also: http://www.ensembl.org/info/data/mysql.html

ADD REPLY
5
Entering edit mode
14.7 years ago
Ian Simpson ▴ 960

Flybase has direct access to its postgres chado database.

http://flybase.org/forums/viewtopic.php?f=14&t=114

hostname: flybase.org
port: 5432
username: flybase
password: no password
database name: flybase

e.g.

psql -h flybase.org -U flybase flybase
ADD COMMENT
1
Entering edit mode

Very helpful sample queries for the FlyBase chado database can be found here:

http://gmod.org/wiki/FlyBase_Field_Mapping_Tables

ADD REPLY
3
Entering edit mode
14.7 years ago
Neilfws 49k

PublicHouse - uses the BioWarehouse system; requires user registration.

ADD COMMENT
3
Entering edit mode
14.7 years ago
Ian Simpson ▴ 960

This is quite an important one for people doing mouse work, though it is important to note that JAX offer Mart and Batch-Query functionality through the web site as well which may well suit many peoples needs.

Note that this is a public 'free' service, but that you do need to contact user support to get your login and password. They are also happy to provide some custom SQL scripts to get you started.

ADD COMMENT
3
Entering edit mode
12.9 years ago
Reece ▴ 310

http://unison-db.org/

Unison is a comprehensive set of protein sequences with lots of precomputed results of many types. Unison was designed primarily for rapid mining of complex combinations of protein features. There's a simple homology-based structure viewer that can map SNPs and arbitrary regions onto structures.

The Unison source and website were released under the Academic Free License, and the data are released under the Creative Commons. There's a public PostgreSQL service (Try 'psql -h unison-db.org -U PUBLIC -d unison'). Command line tools are available. Loading and computing are automated and incremental, and coordinated by a makefile.

See http://unison-db.org/tour/ and http://unison-db.org/doc/ for examples and connection information.

The site is currently hosted at Genentech, my former employer. Unison is used internally at Genentech, but not updated as often as I used to (but there's a page with versions of source databases).

-Reece

ADD COMMENT
2
Entering edit mode
13.2 years ago

Don't forget EnsEMBL's new US East MySQL mirror too:

mysql -h useastdb.ensembl.org -u anonymous -P 5306

See here http://www.ensembl.info/blog/2011/07/06/ensembl-now-has-a-second-public-mysql-server/

You can also run your own mirrors in the cloud http://www.ensembl.info/blog/2011/07/12/run-a-private-ensembl-mysql-in-the-cloud/ ;-)

ADD COMMENT
0
Entering edit mode
13.2 years ago
Sander Timmer ▴ 710

Not really MySQL databases but next to proper SQL access there are quite a few datasources out there that are accessible through REST/SOAP. Many of these sources are listed on http://www.biocatalogue.org/.

ADD COMMENT
0
Entering edit mode
11.1 years ago

(Update 2013-10-08)

The Nencki Genomics Database (NGD) extends the functionality of Ensembl funcgen.

The key enhancements over the Ensembl:

  1. A user can add private data, analyze them alongside the public data, and manage access rights.
  2. Inside the database, we provide efficient algorithms for computing intersections between the regulatory features, and for mapping them to the genes.

mysql -h database.nencki-genomics.org -u public
ADD COMMENT
0
Entering edit mode

If you wish then I can compile a single page document ( Essentially a copy paste) containing information on all these public sql server.

ADD REPLY

Login before adding your answer.

Traffic: 2283 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6