Convert beebase ids
0
0
Entering edit mode
6.4 years ago
The Last Word ▴ 230

I have a bunch of beebase ids. Which converter can i use to convert them to Uniprot ids? Beebase as it is, is a poorly maintained database. Example of the ids are

GB12498

GB16677

GB16154

GB18362

GB30528

GB16592

GB12455

GB30150

GB17048

GB17763

beebase • 1.7k views
ADD COMMENT
0
Entering edit mode

Provide examples of the said ID's.

ADD REPLY
0
Entering edit mode

@genomax please see the edit

ADD REPLY
0
Entering edit mode

Looks like you can use the HymenopteraMine to recover the info you need. Tutorial here.

ADD REPLY
0
Entering edit mode

I tried this before. Sadly, HymenopterMine doesn't work and is not well maintained.

ADD REPLY
0
Entering edit mode

For this query you may have to do the mapping yourself by using original sequences, if the BeeBase is not actively maintained and you have not found alternate means to get this information.

ADD REPLY
0
Entering edit mode

Hi genomax, I think I found the problem. These ids are from official gene set v1.0 for apis mellifera. The latest geneset is OGS3.2. Hymenopteramine is tuned to retrieve ids from only this new dataset. I found a paper that mapped these gene ids to the new geneset and they say they used BLAST but I can't seem to understand how they did that. This is the paper https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4275039/

ADD REPLY
0
Entering edit mode

I see that the BEEBASE ID's in latest GTF file contain none of the ID's you have posted above.

I see a number of arrays listed in EBI Array Express. You can look through these to see which of them matches the ID's you have, which should give you sequence of the oligos for those ID's. I think you will need to take those sequences and use blast against the current genome build to identify current BeeBase ID's.

EDIT: I found the excel file that contains the oligo sequences for the ID's you have, along with the ADF file.

ADD REPLY
0
Entering edit mode

I wanted to ask a question. Here is another oligonucleotide array https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GPL16023 with sequences for the beebase IDs. The sequences in the set that you gave earlier and the ones in this are different. Any idea why that would be the case. I was actually using the former to retrieve the sequences of the probes. Might be a stupid question but my knowledge with microarray analysis is limited.

ADD REPLY
0
Entering edit mode

As you saw from the EBI Array Express link there were multiple versions/types of Apis arrays. This is a good illustration of how messy things become without proper metadata. The link for the excel file was from 2010 where as the platform link you included is from 2012 that must be a newer iteration.

Do you know which array/data your ID's originate from? That would really be the only way to nail this down for sure.

ADD REPLY
0
Entering edit mode

I am actually working on this network https://hood-price.systemsbiology.org/research/honeybee-transcriptional-regulatory-network/ . The paper linked to it is https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4755009/ . I did reread the paper for this information but other than stating that the brain transcriptome data set was generated in Dr. Robinson’s lab. I couldn't find any information about the platform used.

ADD REPLY
0
Entering edit mode

Found it Genomax. Its https://www.ebi.ac.uk/arrayexpress/arrays/A-MEXP-755/?ref=E-TABM-953 . In that paper, in the methods section, he has mentioned some other studies done on the same array. Thank you for your help and guidance.

ADD REPLY

Login before adding your answer.

Traffic: 1938 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6