Where Can I Find Additional Gene Sets To Extend Msigdb For Gsea?
2
1
Entering edit mode
10.7 years ago
enricoferrero ▴ 910

Hi everyone,

I am looking for additional gene sets that can be used to expand the gene sets found in the Molecular Signatures Database (MSigDB). In particular, I am looking for signatures such as the c6 (oncogenic signatures) and c7 (immunologic signatures) gene sets, i.e.: lists of genes up and downregulated across different conditions from a diverse panel of gene expression experiments (micorarray, RNA-seq).

The idea is to then run GSEA using this expanded collection of gene sets and hopefully get some biological insights about the datasets I'm analysing. Ideally, these gene sets would already be in GMT format, but I'm obviously flexible about this.

Hasn't anybody attempted to analyse all of GEO using some fixed thresholds and made a database yet? :)

Thanks very much!

• 4.6k views
ADD COMMENT
2
Entering edit mode
10.7 years ago

I actually developed an algorithm called BD-Func to compare up-regulated versus down-regulated gene expression. The package includes MSigDB listings for MSigDB (although I should add the c7 signatures) and GO (positive regulation of X, negative regulation of X).

You can download the GO gene lists along with the main package:

http://sourceforge.net/projects/bdfunc/

There are also 3 additional signatures specifically defined by BD-Func:

http://sourceforge.net/p/bdfunc/discussion/signatures/

I'll have to check out those links that you provided, but I didn't see a direct link to the positive/negative regulation GO gene lists.

ADD COMMENT
0
Entering edit mode

What do you mean by "positive/negative regulation gene lists"; do you mean gene sets where we have information about up-regulation or down-regulation of their genes?

ADD REPLY
1
Entering edit mode

Up- and Down-regulation here refers to the pathway (as indicated from gene expression changes in other experiments).

Let's say the pathway is Notch signaling. "UP" would be the downstream targets that are activated and "DOWN" would be the downstream targets that are repressed. Or, if the gene list is associated with a drug treatment, it would be the genes with increased expression due to drug treatment versus genes with decreased expression due to drug treatment.

ADD REPLY
0
Entering edit mode

Thanks Charles, that would definitely be useful to have information about Up- and Down-regulation of genes within a gene set. The available gene set databases, such as MSigDB, do not provide any information about Up- and Down-regulation. Do you know how can I get such information about gene sets?

ADD REPLY
1
Entering edit mode

It is in the names of the signatures from MSigDB.

For example, when NOTCH is inhibited by DAPT, these are the up-regulated genes:

http://software.broadinstitute.org/gsea/msigdb/cards/NOTCH_DN.V1_UP

and these are the down-regulated genes:

http://software.broadinstitute.org/gsea/msigdb/cards/NOTCH_DN.V1_DN

In this case, the nomenclature is a little confusing

ADD REPLY
1
Entering edit mode
10.7 years ago
enricoferrero ▴ 910

Here's what I've found so far:

GeneSetDB (PubMed)

GeneSigDB (PubMed)

PAGED (PubMed)

Enriched Maps Gene Sets (PubMed)

ADD COMMENT
0
Entering edit mode

Would you please et me know if I can get PAGED database (I mean the whole database)

ADD REPLY

Login before adding your answer.

Traffic: 2234 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6