Entering edit mode
8.6 years ago
Picasa
▴
650
In order to find some fungi in my dataset,
Can you recommend a complete fungi database ?
Thanks
In order to find some fungi in my dataset,
Can you recommend a complete fungi database ?
Thanks
Thanks but do I need to download each species ? or there is a global database. (Im looking for fasta actually)
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
In future use the "Add comment" button when you are providing supplementary information or comments.
You would need to get the fasta genome files from the link @5heikki provided. Similar link for Ensembl is here. It may be possible to recover these from refseq_genome blast database (if you have that handy) and the blastdbcmd tool.
Something like below to download the latest fungal assemblies from GenBank into a dir:
Thanks for your help, can you explain your command ??
Download ftp://ftp.ncbi.nlm.nih.gov/genomes/genbank/fungi/assembly_summary.txt and output to STDOUT.
Parse the 20th column (columns separated by tabs) from STDIN and output to STDOUT (20th column of the assembly_summary.txt file has the base ftp urls for the genome assemblies).
Print the base ftp url and append: slash, the 6th field of the base ftp url, and _genomic.fna.gz. Output to STDOUT (fields separated by slashes, e.g. ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA_001600535.1_JCM_30696_assembly_v001 becomes ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA_001600535.1_JCM_30696_assembly_v001/GCA_001600535.1_JCM_30696_assembly_v001_genomic.fna.gz). As far as I know, this is a valid url construction method to all latest GenBank assemblies.
Provide each constructed ftp url to wget one by one from STDIN:
Commands are glued together by pipes, i.e. "|"