Hi dears!
I have a local database of miRNA::gene interactions for hsa, which contains just "GeneID" (Entrez ID) and "Gene Name" for Genes. I want to extract gene sequences for my genes either with the name or with the ID.
I have two different ideas:
1)I thought about finding a complete downloadable database of human genes, containing id,name and whole sequence of the genes! which I could not find!
or
2)using bioperl (or other similar stuff) and Bio::DB::EntrezGene to extract the gene sequences but it seems that "Bio::DB::EntrezGene" is broken : http://stackoverflow.com/questions/16199037/bioperl-module-biodbentrezgene-no-longer-working
what do you suggest dudes? I am really confused!!!
You can fetch sequences using Entrez ID as the filter in BioMart. Please search this site for the many usage examples.
Please note there are many BioMart installations providing many different databases, in this case you mean the Ensembl BioMart, and most likely the "Ensembl Genes" database available from that BioMart, which contains annotations using Entrez Gene.
yeah bro! It was a semi-solution for me! just a question: which field in the "Ensembl BioMart" is related to the Gene Sequence? (there are lots there!)
When you click "Attributes", you should see "Sequences" as an option right at the top.
Tnx, yes I know, Im now working on BioMart . Do you mean using EMBL Perl API ? or something else?