Hello,
I am a new user of command line blast. I am using a protein sequence query to search through a DNA database I created with makeblastdb. I am getting appropriate results and all is well but I cannot seem to be able to get a fasta file with the complete sequences of the results.
To make clear, I am wanting one file with information such as the e-value and score in tab format (which I am able to get) and in addition, a fasta file with the complete sequences of the resulted accession numbers. If possible I'd want the translated sequence, in amino acids and not DNA.
my command looks like this: tblastn -query query.fasta -db blastdatabase -outfmt 6 -num_threads 3 -max_target_seqs 2000 -out tblastn_DB.tab
I know it's a simple question but I have not been able to solve it looking in the NCBI BLAST command line cookbook.
Thank you very much!
I like @cschu1981 answer. Translating will be a little more difficult, unless they are an ORF, since you won't know which frame to translate in. However, you can look into EMBOSS transeq for translating your sequences. Did you get your DNA db from a public domain? Perhaps there is already a protein file you can cross-reference your db ids.