Hi, I am trying to perform GO term enrichment analysis using GO::TermFinder Perl module. I have already gone thro' most of the posts that I found relevant to me here on biostar. Especially this one. Amidst all other softwares, I'd like to be able to use GO::TermFinder as I find it a bit more powerful tool and I am quite comfortable with Perl.
Now, I require two gene files (1) candidate genes and 2) background genes) and in addition, the gene ontology term definitions and finally a gene-association file: here's an example. I am working on tomatoes (Solanum lycopersicum). And the genome is about to be published and as far as I have searched, there doesn't exist a gene-association file (for Arabidopsis thaliana it seems to exist, of course). Its seems to be basically a file with geneID and corresponding GO terms (+ some more info). Now, I have the GO terms for 22000 (out of 37000) genes from the current gene model annotation. My question is, is there any software or an alternative easier approach (I don't mind coding) to gather info required to construct this file. I just have the geneID, GO terms and functional annotations.
This seems to be the only missing link for me to use GO::TermFinder (and for that matter, any other softwares, I'd suppose?? ).
Hi Arun, I have a problem similar to yours (but I hope you have solved it). I'm working on tomato genes and I need a gene association file to use Ontologizer. I have downloaded the same file you used to obtain GO terms (ITAG2.3genemodels), but I have been able to isolate only about 19000 GO terms, so the first question I ask you is the method you used for extracting associations between GO terms and gene names. It would be very useful for me to retrieve about 3000 more genes! Then I ask you if you have found a method to obtain a gene association file.
Thank you in advance for your help, Raffaella
Hi Raffaella, I have same questions you asked here and apparently, they are old questions. I tried to get the gene association file provided by the Solanaceae genome group but it still contains GO terms for only about 500 genes. I am wondering if you've succeeded to generate a gene association file for tomato, and I also would like to know the method you used to obtain GO terms.
THANK YOU VERY MUCH! Xin
I switched to MapMan software+annotations.