How should a query a gene type in multiple genomes?
2
0
Entering edit mode
6.6 years ago

I want to query all genes of a specific type (eg.: microRNAs) in genomes of multiple species. Initially I was doing this using Ensemble's BioMart, but it limits me to query a single genome at a time and I the web interface makes it impossible to automate the process. Is there a better way of doing this task?

Thank you.

query search • 1.5k views
ADD COMMENT
1
Entering edit mode
6.6 years ago
JJ ▴ 710

With query you mean what exactly? What do you want to get?

However, try the R package BiomaRt - this should allow you to automate the process. Alternatively download the gff / gtf files of all the species from Ensembl and do it in bash by selecting the biotype.

ADD COMMENT
0
Entering edit mode

The problem with BiomaRt is that I can only query one database (in this case, one genome) at a time. I suppose I could create a loop to automatequerying though multiple databases.

ADD REPLY
1
Entering edit mode

Yes, that's true but I cannot think of an approach where you can select multiple species at the same time. I personally would prefer using bash (e.g., grep) and selecting certain biotypes from all the gtf/gff files - this could be neatly automated.

ADD REPLY
0
Entering edit mode
6.6 years ago
GenoMax 147k

Use RNA central for things related to RNA. Example query for microRNA. Limit by selecting species in left column. There is an API available.

For other types of data NCBI's eUtils or Ensembl API would be the way to go.

ADD COMMENT
0
Entering edit mode

I didn't know about RNA Central. I'll look into it right now. Thank you!

As far as I know, it's not possible to query a specific gene type in Entrez, but I'll investigate it.

ADD REPLY
0
Entering edit mode

Can you post an example of gene type you would look for?

ADD REPLY

Login before adding your answer.

Traffic: 1629 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6