Hi all,
I would like to do an enrichment analysis/gene ontology analysis after an RNAseq differential expression analysis. For this, I would like to use a set of reference genes. I have already tried GO, but it's giving me some errors (it says that I have duplicate genes, while I've checked and I don't have them). Anyone could suggest a different tool?? Or, anyone that has an idea why I'm getting this error? Thanks a lot in advance!
What is the output of (assuming a plain text file with one gene per row):
1)
sort -k1,1 your.genes | wc -l
2)sort -k1,1 -u your.genes | wc -l
The output of both commands is the same: 17647 genes... Thank you anyway ;)
Which tool did you use and how?
I used webtool Gene Ontology (GO):http://geneontology.org/ I only pasted my list on the webpage and it redirects me to PANTHER, where I can add my reference gene list. At this point it tells me that I have duplicate genes in my reference list...
The error is most likely caused by your input IDs or names being converted to the protein accession numbers used by Panther. This means that several of your inputs correspond/are mapped to the same protein.