database for fungus ?
3
0
Entering edit mode
8.7 years ago
Picasa ▴ 650

In order to find some fungi in my dataset,

Can you recommend a complete fungi database ?

Thanks

fungus databse • 3.5k views
ADD COMMENT
2
Entering edit mode
ADD COMMENT
1
Entering edit mode
ADD COMMENT
0
Entering edit mode
8.7 years ago
Picasa ▴ 650

Thanks but do I need to download each species ? or there is a global database. (Im looking for fasta actually)

ADD COMMENT
1
Entering edit mode

In future use the "Add comment" button when you are providing supplementary information or comments.
You would need to get the fasta genome files from the link @5heikki provided. Similar link for Ensembl is here. It may be possible to recover these from refseq_genome blast database (if you have that handy) and the blastdbcmd tool.

ADD REPLY
1
Entering edit mode

Something like below to download the latest fungal assemblies from GenBank into a dir:

wget -q -O- ftp://ftp.ncbi.nlm.nih.gov/genomes/genbank/fungi/assembly_summary.txt | cut -f 20 -d $'\t' | awk '{FS="/"}{print $0"/"$6"_genomic.fna.gz"}' | xargs -n 1 wget
ADD REPLY
0
Entering edit mode

Thanks for your help, can you explain your command ??

ADD REPLY
1
Entering edit mode
wget -q -O- ftp://ftp.ncbi.nlm.nih.gov/genomes/genbank/fungi/assembly_summary.txt

Download ftp://ftp.ncbi.nlm.nih.gov/genomes/genbank/fungi/assembly_summary.txt and output to STDOUT.

cut -f 20 -d $'\t'

Parse the 20th column (columns separated by tabs) from STDIN and output to STDOUT (20th column of the assembly_summary.txt file has the base ftp urls for the genome assemblies).

awk '{FS="/"}{print $0"/"$6"_genomic.fna.gz"}'

Print the base ftp url and append: slash, the 6th field of the base ftp url, and _genomic.fna.gz. Output to STDOUT (fields separated by slashes, e.g. ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA_001600535.1_JCM_30696_assembly_v001 becomes ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA_001600535.1_JCM_30696_assembly_v001/GCA_001600535.1_JCM_30696_assembly_v001_genomic.fna.gz). As far as I know, this is a valid url construction method to all latest GenBank assemblies.

xargs -n 1 wget

Provide each constructed ftp url to wget one by one from STDIN:

Commands are glued together by pipes, i.e. "|"

ADD REPLY

Login before adding your answer.

Traffic: 2357 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6