Entering edit mode
5.3 years ago
baurumon
▴
30
This might be a stupid question but I am new in bioinformatics.
I have few FASTA files which I want to run reciprocal blast. Can anyone please guide me through makeblastdb command line process.
- I have downloaded NCBI blast database and extracted nr FASTA file through blastdbcmd
- Should I run makeblastdb command with my FASTA file or nr FASTA to create database?
- what will be the next step after creating database?
If you follow the reciprocal blast algorithm, e.g. as described at https://www.flyrnai.org/RNAi_orthology.html,then:
1.) Extract a set of genes of interest from an existing database. Generate a blast database for this set. 2.) Generate a blast database for your query sequences.
3.) For each gene in the set generated in 1.): a.) Blast it against your query sequences b.) Take the sequence of the best hit and blast it against the original set of genes of interest. If this is the same as the gene of interest, you have a potential ortholog.
This means, you need to create blast databases for both your genes of interest as well as your query set. Since you do two blast calls for each gene of interest, it does not really make sense to use the whole of NCBI nr as genes of interest. It would be more appropriate to take a set from a related organism or a specific set of genes (e.g. plant R-genes etc.)
Thank you for the explanation !!!!
i used this command to blast in database blastn -query work.fa -db ~/Blast/ssalar/ssalar -outfmt '6 qseqid sseqid sseq' -out newblast_out -word_size 11 -evalue 1e-5 -num_alignments 1 -num_threads 10
and got output as bellow. As you can see i have several hit for one sequence. how can i filter best hit from this file.
this issue is solved. thanks
Reciprocal blast is generally between a small set of sequences, not NCBI NR.