Entering edit mode
8.5 years ago
biotech
▴
570
I'm trying to add functional information to a fungus genome annotated with AUGUSTUS. Here is my command. Is taking forever. Just one protein inside 'example_1.faa'
blastp -db /home/bernardo/Databases/BLAST/nr -query example_1.faa -out AWNI01.tab -evalue 0.000001 -outfmt "6 qseqid stitle pident length mismatch gapopen qstart qend sstart send evalue bitscore qcovs qcovhsp" -max_target_seqs 1 -num_threads 16 # note outfmt and max_target_seqs
I thought about changing to other database. Maybe swissprot or refseq_protein, but possibly I will miss annotations.
Thanks
If this is a standalone machine you may have done all you probably can but have you checked to see if all of your CPU cores are 100% busy (top/htop etc)? If they are not it could be an indication that your system is I/O bound and you can't do much else but be patient.
Do you have some benchmarking data for BLASTp? I have 6K proteins in my input file. I will have to wait maybe two or three days?
You said you have only one protein in the file. So this is just a test?
If you have 6K proteins to do then you would want to do this somewhere else. Ideally a cluster with your input file split into 100 sequence chunks, in parallel.
Yes it's just a test. Thanks for the chunks idea.
I'm trying to annotate a fungus. Maybe extracting a group of fungus data from the nr database would be an alternative.
Did you download the complete nr database from the ftp site (most recent has 49 parts)? Did you do makeblastdb on all parts of the nr db (you should have gotten an error if you didn't)? This post could also help: Blast Help On Nucleotide Collection Nr/Nt
It's running. I have correct output. Dowloaded the preformatted BLAST database for nr.
I did this nohup wget ftp://anonymous@ftp.ncbi.nih.gov/blast/db/nr.* > foo_wget3.out 2> foo_wget3.err < /dev/null &
I will go towards custom database for now. I will use all proteins of sequenced same fungus genus. Seems there are some full genomes available. Just 26K, what a tiny thing!!
Great. Took just 20min, the exact time for a coffee.