Hi All,
I am trying to make a 'HOW TO' on gene ontology analysis. If you know answer of these questions please help. I hope it will be very useful to all people who are new to gene ontology analysis and bioinformatic (like me :) Please post answers in one specific tool/language (preferable in R,python,perl)
Give a GO ID (e.g. GO:0090342) How to:
- find all its children up-to a specific depth. For example find up-to 4th level
- find all its parents and grand parents up-to a specific height. For example up-to 4th level
- the name (definition) of the GO ID
- draw the tree (with name (definition) & GO ID)
- find all the directly associated gene/protein with specific GO ID
- find all gene/protein associated with specific GO ID and its children (up-to level n)
Given a gene/protein ID/name (e.g. UniProtKB ID) How to:
- find all the associated GO IDs with specific type (e.g. all GO IDs associated with UniProtKB ID which are related to 'biological process' )
- remove all over-represented GO IDs (from the result of last query). (e.g. GO:0090342 and GO:0050793 both are associated with p53 but as GO:0050793 is child of GO:0090342 so I want to remove GO:0090342 from the data and want to keep only GO:0050793)
- find whether a specific GO ID is associated with a given UniProtKB ID or not?
(Some questions may be repeat/extension of previous questions, still its good to have a direct answer)