Integrating gene ontology DB's into pipeline scripts
2
0
Entering edit mode
1 day ago
joemrosen • 0

Does anybody have suggestions for integrating gene ontology DBs or BLAST/refseq into bioinformatics pipelines? My goal was to be able to automatically search my output (filtered by impact refseq transcript ID's and/or HUGO gene IDs) against a gene ontology DB such as DAVID or Refseq, then automatically fetch the results to display in a clinical report. I was thinking of maybe building a web scraping script since there doesn't seem to be much support for this sort of workflow API wise.

DAVID Pipelines GO • 214 views
ADD COMMENT
0
Entering edit mode

why scraping as all those databases are available for download ?

ADD REPLY
1
Entering edit mode
15 hours ago
EagleEye 7.6k

Have a look at GeneSCF. It is easy to integrate in any bioinformatics pipelines.

ADD COMMENT
0
Entering edit mode

Looks good, thanks for the tip!

ADD REPLY
0
Entering edit mode
21 hours ago

There should be no need to scrape websites; there are a large number of tools that implement various ontology and functional annotation-related functionalities.

Most are written in R. I would investigate those.

FWIW even DAVID has a web API:

ADD COMMENT
0
Entering edit mode

I tried the DAVID api but the output isnt very detailed, I'm not looking for simple gene annotation, I already have that with snpEff and other tools, I was looking to write a script that takes my list of annotated genes, queries the DB for deep ontological info, and generates an in depth report based on it. some thing like the example below, I know blast has rBlast, but it does it provide deep detail?

High Impact Variant Results: NM_123: High impact, associated with X disease etc.. NM_456: High Impact associated with Y disease, seen in Z populations etc

ADD REPLY

Login before adding your answer.

Traffic: 3365 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6