Hello,
I am would like to download all genes of i.e. arabidopsis thaliana from i.e. circadian rhythm pathway in fasta format. Is there a way to download them in a batch mode, instead of opening and saving every single gene in a fasta format?
Hello,
I am would like to download all genes of i.e. arabidopsis thaliana from i.e. circadian rhythm pathway in fasta format. Is there a way to download them in a batch mode, instead of opening and saving every single gene in a fasta format?
Each KEGG pathway's KGML file should contain all the associated Entrez IDs. This way you wouldn't need the sequences, but are easily accessible if you should require them.
There are many ways to do this...One way is to use the keggGet function in the KEGGREST package available on Bioconductor. There is a limitation to the number of IDs you can search at one time so you will need to make a simple for loop to iterate over the vector of your circadian rhythm NCBI Entrez IDs. I haven't used it in a very long time but try the plant BiomaRt database online or R package for which Arabidopsis is available.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
You can use download section of wikipathways to download all the pathways in your preferred organism: https://www.wikipathways.org/index.php/Download_Pathways