Automated Submission Of Jobs To Ensembl Biomart
1
1
Entering edit mode
12.3 years ago
Rubal7 ▴ 850

Hello Everyone,

I am generating multiple large lists of genomic regions. To find out what genes are in these regions usually I would submit the regions to ensemble biomart. However now I generate too many lists for it to be feasible to do this manually. Does anyone know if it is possible to (simply - I only have basic programming skills) automate this process with a script or if there is a similar service that can do this automatically with list of files or some commands.

ps I would also be interested if people know/use a similar automated approach for looking for pathway enrichment, as this is what I will then do with the list of genes.

Thanks very much in advance for your help!

Best regards,

Rubal

ensembl biomart genome • 2.6k views
ADD COMMENT
0
Entering edit mode

Which command you use for it, a wrapper script can be made according to that, if you tell us.

ADD REPLY
0
Entering edit mode

Currently I go to ensembl biomart using the browser, select the ensebml genes database, choose homo sapiens as species, then under filters, select multiple chromosomal regions and link to a file containing these regions. Then this outputs a file containing the gene ids in the region

ADD REPLY
0
Entering edit mode

Try using biomaRt r package, you can pull data into R interface directly and can either export it or process it. You can make automated Rscripts for that. I don't work with it, so maybe someone else, working on it will help you with documentation and scripts. Tutorial

ADD REPLY
0
Entering edit mode

biomart looks perfect for this thank you. It even has some relevant examples in the documentation. Assuming there is a way to submit a list of regions in Biomart rather than one region at a time this will be great (I assume there is a way to do this, I will keep looking)

ADD REPLY
2
Entering edit mode
12.3 years ago
Joachim ★ 2.9k

In order to automate your queries, you would first carry out an example query that suits your needs by hand. That means that you set the dataset you would like to query, the attributes whose values you would like to be returned, and the filters that constrain your result set. After pressing "Results", you can then click on the "Perl" button to get a Perl script that you can modify to your needs. Alternatively, you can click on "XML" to get a BioMart XML query whose values you can programmatically alter.

In a nutshell:

  1. carry out one query manually (set dataset, filters & attributes)
  2. get the programmatic query equivalent by clicking on "XML" or "Perl"
  3. use the XML or Perl to programmatically query BioMart

A more detailed description can be found in this earlier answer that I gave.

ADD COMMENT

Login before adding your answer.

Traffic: 2017 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6