Hi,
I've downloaded the gene_association.goa_human
file from the ebi website. I tried running a python script to filter out GO IDs with "membrane" in the description. This approach doesn't seem to be working because the target receptor I am looking for doesn't contain "membrane" in the description ( though it is a membrane receptor). The QuickGo search feature on the ebi website using the GO id corresponding to the target receptor, however, gives me the correct description in the name field (name: membrane). Is there anyway I can get a list of human GO ID's that contain the appropriate descriptions (the gene_assocation.goa_human
file doesn't work ). My aim is to generate a list of GO ID's and Gene names that correspond to membrane expressed proteins.
Thanks
Appreciate any help
Siddharth
Hi
Thanks a ton, this is exactly what I was looking for. I am having problems with downloading however. I am unable to download more than 5000 of the results at a time ( the prompt at the website says 10,000 lines). And how do I go about downloading the next batch of results ( 5000 - 10000) and so on ?
Could you pass me a link to the results you're trying to download? I'm asking because I just downloaded the file containing all gene products related to membrane. It's a way more than 10,000 lines and I didn't encounter any prompt saying there's a limit.
http://amigo.geneontology.org/amigo/term/GO:0016020#display-lineage-tab
That's the results page, and upon clicking on the download link, I get the prompt
You're right - there is a limit up to 10,000 lines. I don't see a simple way to overcome this limitation. However, you can download the results in parts of 10,000 lines. Click the download button and display first 10,000 lines. Look at the url address in your browser. For example, in my browser it looks like this:
Notice, that there is a start variable in the url. There, you can specify the row number to be displayed as first. For example, if you want to see the next 10,000 lines, just change the value of the start variable from 0 to 10000.
Alternatively, you may download all gene products related to membrane using QuickGO form EBI. The philosophy is the same: enter GO accession number to see all proteins that are related to it.
Thanks a ton. I was agonising over that next step. :)