How to combine a different sources of pathway database (KEGG, BioCarta) for Pathway analysis.
0
4
Entering edit mode
8.4 years ago
kmsh410 ▴ 70

Hi all Biostars, I have a question about how to combine different pathway database resources, since I want to ensure comprehensive coverage of pathways for conducting my pathway analysis. I got a gene-list from VEGAS2 website. Now I want to use this gene list to run pathway analysis. Can someone give me some idea/direction to solve this probelm?

Pathway enrichment analysis GWAS pathway analysis • 5.7k views
ADD COMMENT
1
Entering edit mode

I will not suggest to combine different pathway databases and do enrichment analysis (the statistics will be non-reliable due to its redundancy). Instead you can use the tools that uses most updated information to perform enrichment analysis.

If you are not a Linux user or R user, try web based tools like DAVID (bit outdated information).

But if you are familiar with Linux, use GeneSCF (uses updated information directly from the source in real-time). You can try by using KEGG or REACTOME as source database.

Gene Set Clustering based on Functional annotation (GeneSCF)

ADD REPLY
2
Entering edit mode

The DAVID has been up updated recently with refined features.

ADD REPLY
0
Entering edit mode

Yes I know it is. But still it does not hold CURRENT (realtime) versions of KEGG or geneontology. The point I was telling before is updating the functional annotation to do enrichment analysis (not the tool but the annotation).

https://david.ncifcrf.gov/content.jsp?file=release.html

And still KEGG and BIOCARTA is not a new version as I can see.

https://david.ncifcrf.gov/content.jsp?file=update.html

http://www.genome.jp/kegg/docs/relnote.html

ADD REPLY
0
Entering edit mode

Actually, I want to use some R packages on Bioconductor, such as gage and paxtoolrs packages, but I need to take some time to figure out how to use these packages. I haven't use these tools from Bioconductor before. I also found some experts recommended Enrichr. This one is very convenient, but I still want to learn how to use R or Linux to do analysis.

By the way, thanks you for introducing GeneSCF for me. I will try it.

ADD REPLY
1
Entering edit mode

Good luck with your analysis.

ADD REPLY
1
Entering edit mode

If you get paxtoolsr installed you can grab all the gene sets provided by Pathway Commons by running this command:

geneSets <- downloadPc2("PathwayCommons.8.All.GSEA.hgnc.gmt.gz", version="8")

as a list of vectors. You'll likely want to filter out pathways thas have one or few genes in them using code similar to below:

tmp <- unname(unlist(lapply(geneSets, length)))
idx <- which(tmp > 3)

# Gene sets with more than 3 genes
filteredGeneSets <- geneSets[idx]

Pathway Commons does its best to ensure the best reproduction of the original datasets, which sometimes includes these small pathways.

If you're having problems installing paxtoolsr (especially because of the Java dependency), these videos may help, which provide instructions for OSX, Linux, and Windows:

ADD REPLY

Login before adding your answer.

Traffic: 2455 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6