What Does 'Intactid' Stand For In Uniprot Xml Files (Comment Binary Interactions Section) ?
5
3
Entering edit mode
13.9 years ago
Pablo Pareja ★ 1.6k

Hi all, I was wondering what exactly intactId stands for in uniprot entries. For example in the entry with accession Q9V3W6 you can find:

<comment type="interaction"> 
<interactant intactId="EBI-126914" /> 
<interactant intactId="EBI-213208"> 
<id>P13607-1</id> 
<label>Atpalpha</label> 
</interactant> 
<organismsDiffer>false</organismsDiffer> 
<experiments>1</experiments> 
</comment>

I've been checking the ids in intact db website and sometimes I get the same results as entering the accession of one of the proteins, but not always. I thought as well that they could be specific ids for interactions between two proteins, i.e. a unique id for each pair. I would appreciate any contribution. Regards,

Pablo Pareja

uniprot xml interaction • 4.0k views
ADD COMMENT
3
Entering edit mode
13.9 years ago

Hi guys,

I will try to explain the reason why some of our proteins do have multiple ACs.

In the case above, molecule B is a splice variant:

$2 ID(s) interactor B : uniprotkb:P15806-2|intact:EBI-413585

The way we are dealing with splice variant (and feature chains for that matter) is that we are creating a dedicated protein in the database and linking it to it's parent (i.e. the uniprot entry where the splice variant is described) by creating a cross reference to the AC of that protein in the database. So what you are seeing line 21 (below) is that specific cross reference and you can also see in parenthesis the protein ID:

$21 Properties interactor B : intact:EBI-81370(tfe2_mouse)|uniprotkb:P15806-2(identity)

I hope that clarifies this one.

Cheers
Sam

ADD COMMENT
2
Entering edit mode
13.9 years ago

Hi Pablo, welcome on biostar,

I guess the Id in InAct ( http://www.ebi.ac.uk/intact/search/do/search?binary=EBI-126914,EBI-213208) is just an internal Id for a protein. So, EBI-126914 is just the IntAct-Id for 'prp7'

Update:after I saw Pablo's I grep'ed the IntAct data; there are only eight interactions with EBI-81370:

curl -s "ftp://ftp.ebi.ac.uk/pub/databases/intact/current/psimitab/intact.txt"  | cut -d '     ' -f 1,2 | egrep '(#|EBI\-81370)'
#ID(s) interactor A ID(s) interactor B
uniprotkb:Q64279|intact:EBI-81361   uniprotkb:P15806|intact:EBI-81370
uniprotkb:P15806|intact:EBI-81370   uniprotkb:Q61039|intact:EBI-81388
uniprotkb:Q64279|intact:EBI-81361   uniprotkb:P15806|intact:EBI-81370
uniprotkb:Q61039|intact:EBI-81388   uniprotkb:P15806|intact:EBI-81370
uniprotkb:P15806|intact:EBI-81370   uniprotkb:P41136|intact:EBI-309167
uniprotkb:P15806|intact:EBI-81370   uniprotkb:P20067|intact:EBI-309519
uniprotkb:P15806|intact:EBI-81370   uniprotkb:O35437|intact:EBI-309880
uniprotkb:P15806|intact:EBI-81370   uniprotkb:Q62392|intact:EBI-309727

but if you're searching for in all the available fields, you'll get 14 records:

 curl -s  "ftp://ftp.ebi.ac.uk/pub/databases/intact/current/psimitab/intact.txt"  | egrep -w  '(#ID|EBI\-81370)' | cut -d '      ' -f 1,2 | cat -n
     1  #ID(s) interactor A ID(s) interactor B
     2  uniprotkb:Q64279|intact:EBI-81361   uniprotkb:P15806|intact:EBI-81370
     3  uniprotkb:P15806|intact:EBI-81370   uniprotkb:Q61039|intact:EBI-81388
     4  uniprotkb:Q64279|intact:EBI-81361   uniprotkb:P15806|intact:EBI-81370
     5  uniprotkb:Q61039|intact:EBI-81388   uniprotkb:P15806|intact:EBI-81370
     6  uniprotkb:P15806|intact:EBI-81370   uniprotkb:P41136|intact:EBI-309167
     7  uniprotkb:P15806|intact:EBI-81370   uniprotkb:P20067|intact:EBI-309519
     8  uniprotkb:P15806|intact:EBI-81370   uniprotkb:O35437|intact:EBI-309880
     9  uniprotkb:P15806|intact:EBI-81370   uniprotkb:Q62392|intact:EBI-309727
    10  uniprotkb:P15806-2|intact:EBI-413585    uniprotkb:P15923-2|intact:EBI-769650
    11  uniprotkb:P15806-2|intact:EBI-413585    uniprotkb:Q9JKN5|intact:EBI-1213712
    12  uniprotkb:P15806-2|intact:EBI-413585    uniprotkb:Q9EQW6|intact:EBI-1213740
    13  uniprotkb:P15806-1|intact:EBI-413581    uniprotkb:Q9JKN5|intact:EBI-1213712
    14  uniprotkb:P15806-1|intact:EBI-413581    uniprotkb:Q9EQW6|intact:EBI-1213740
    15  uniprotkb:P63280|intact:EBI-80180   uniprotkb:P15806-2|intact:EBI-413585

In the case of the interaction: uniprotkb:P63280/ uniprotkb:P15806 ,EBI-81370 appears in the field $21: Properties interactor B. But here I don't know what this property means.... :-/

ADD COMMENT
0
Entering edit mode

Hi Pierre,

That's what I thought first but then I came across some examples like this one in protein Q64279:

[?] [?] [?] [?]P15806[?] [?]Tcf3[?] [?] [?]false[?] [?]1[?] [?]

If you search for the interactant 'P15806' in intact db you get 8 interactions; however if you use its intactId 'EBI-81370' you get 14 results. It looks like they use the same intactId for both protein and its isoforms at least in this case...

ADD REPLY
1
Entering edit mode
13.9 years ago
Raquel Tobes ▴ 160

I think that all the information included in the section comments of a Uniprot entry are manual annotations. These curator annotations probably are a selection of the complete set of interactions collected for a specific entry (in this case for the entry Q9V3W6) in the database IntAct. There is a cross-reference to all these interactions in the section of cross-references entitled "Protein-protein interaction databases" (it is a link to the 8 interactions available at the IntAct database) in the Q9V3W6 Uniprot entry.

The manual annotations have not to be equal to the complete set of interactions available at the IntAct database.

In the comments content:

<interactant intactId="EBI-126914" />
<interactant intactId="EBI-213208">
  <id>P13607-1</id>
  <label>Atpalpha</label>
</interactant>

EBI-126914 is the ID of the entry Q9V3W6

EBI-213208 is the ID of the isoform P13607-1 (Sodium/potassium-transporting ATPase subunit alpha)

You can access to the data of the interaction between the two interactant proteins introducing at the Search box at the URL of IntAct:EBI-126914 AND EBI-213208

ADD COMMENT
1
Entering edit mode
13.9 years ago
Pablo Pareja ★ 1.6k

First of all thanks for your answers.

Summing up it seems that intact-ids like 'EBI-213208' are unique identifiers for both proteins and isoforms. The main reason why I was confused was that when I performed some standard searches in the IntAct DB website the number of results I obtained differed depending on whether I used the intact-id or its supposably equivalent uniprot accession. However I've just checked what happens if I search for results specifying the advanced search field Participant Id and I get the same number of results using either of them.

So I guess that if you just use the standard search instead, you'll probably get as results every entry where the string provided is found, regardless of its position/meaning/etc.

We can see how in the example provided by Pierre there is an occurrence of the string provided in a property line #21. which by the way I do not know either what it means... :-|

ADD COMMENT
0
Entering edit mode
13.9 years ago
Jerven ▴ 660

The intactid identifies a participant protein. While this may map to a single uniprot protein, in one case it might be expressed in a yeast hybrid experiment or in an experimental setting which can affect real biology. Also the mappings to UniProt sequences are best effort and not guaranteed to be correct for all experiments. For these reasons every interactor in intact has its own id.

Of course you should ask help@uniprot.org to find out exactly what it means.

ADD COMMENT
0
Entering edit mode

Thanks for your answer. I already ask at help@uniprot.org but didn't get any email back yet. I'll post whatever information they provide me as soon as I get it.

ADD REPLY

Login before adding your answer.

Traffic: 1772 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6