Hi,
I want to get the gene names from 2400000 snps using BiomaRt in R. But since the snps size is huge, I could not get the result of these snps' annotation for one time. Do you guys have any other suggestions?
Thank you in advance!
Hi,
I want to get the gene names from 2400000 snps using BiomaRt in R. But since the snps size is huge, I could not get the result of these snps' annotation for one time. Do you guys have any other suggestions?
Thank you in advance!
As Wouter says, BioMart is not really a suitable tool for this as it cannot handle that much data. The Variant Effect Predictor (or VEP) can, however take millions of variants and output for you which genes they hit and what effects they have. It will also allow you to filter your list by consequences, frequency, gene names etc, so that you can prioritise variants for further study. For your number of variants I would recommend using the standalone script with a cache, all of which is documented.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
And it has to be biomaRt in R? Because for example VEP would be the most commonly used tool for snp annotation from Ensembl.
Also, it's not clear in which format your data is, please add an example to your post.
Possible duplicate, related posts:
Snp Annotation Tool