Hello all,
I'm trying to access GO terms for a list of genes, but they're not easily available. Manually, these are the steps I'd take:
- Search gene ID on NCBI; find protein ID
- Search protein ID on Uniprot/Uniparc; find Pfam ID
- Search Pfam ID on Pfam; switch to InterPro tab, get GO terms
I have seen the e-utilities functionality by Entrez , but I'm having trouble understanding how to use it. When I try a command such as
esearch -db gene -query 'LOC115705987' | elink -target protfam | esearch -db protein -query [PROT]
I get a response like
<ENTREZ_DIRECT>
<Db>protein</Db>
<WebEnv>MCID_61e013e99059043a7d58ece9</WebEnv>
<QueryKey>3</QueryKey>
<Count>1001315687</Count>
<Step>3</Step>
</ENTREZ_DIRECT>
My question is, how can I use this information to get (at first) the protein ID, and subsequently the GO terms?
Thanks in advance!
Thank you! Do you know if uniprot, pfam, and interpro can also be accessed through this? Here I don't see any corresponding databases.