Hello,
I am currently extending my web-service and online resource http://www.opacmo.org/ to support downloads in RDF n-triple format. Most of the data in opacmo already comes from ontologies (GO, DO, ChEBI), but there are also PubMed Central identifiers, Entrez genes and species ids from NCBI's taxonomy present. For the latter three I am not quite sure how I would express their "type" in the RDF output.
For example, if I have an Entrez gene ID, I could give it an RDF type via "is_a" as "Entrez Gene" from the NCI Thesaurus (first search result of http://bioportal.bioontology.org/search?query=entrez+gene&commit=Search). However, if I look for "Pubmed Central" in the BioPortal, then I only get "pmc" from the EDAM ontology as somewhat meaningful result, but EDAM's "pmc" refers actually to the article format and not the identifier.
In a sense I am really looking for an ontology that covers most common biologically/biomedically relevant identifiers. EDAM is already a good start, because it can identify Ensembl gene IDs and PubMed IDs -- even though it does not help me very much with my problem.
Does anyone know if there is a single ontology (like EDAM) that I could use to semantically annotate PubMed Central identifiers, Entrez gene IDs and species IDs from NCBI's taxonomy? Is there an alternative way of giving these identifiers a type in RDF?
Thanks, Joachim
Thank you for pointing out BioPAX. I had a look at their ontology (http://www.biopax.org/release/biopax-level3.owl) and your example and it is indeed a very intuitive approach to the problem.