How do I best manage multiple BLAST databases?
So I am pretty new to Bioinformatics, but I am a computer guy. I have a few questions about how to best use blastn to achieve my goal.
I have multiple databases I prepared, creatively named virus.fa, bacteria.fa, fungi.fa, human.fa, mouse.fa, and rat.fa.
I want to be able to BLAST against any combination of the databases, hopefully without doing anything crazy like computing all database permutations.
- I don't think I can BLAST against multiple databases at once, is that correct?
- As I understand it, I can make 6 separate blast databases and blast against them one at a time. Then I concatenate the results.
- Is this computationally wasteful?
- I could make 1 big database and ignore hits for organisms I don't want to search.
- This is obviously wasteful.
- Can makeblastdb take multiple input files? I don't think it can, so I would have to cat them up before making the db.
- Is there a solution I am missing, hopefully an elegant solution?
Thank you,
Jeremy Cox
CSE PhD student
Based on the information you provided, it looks like I can keep the 6 databases separate and create an alias file to refer to multiple databases.
I think this is the opposite of what you described in making alias for subsets?
Yes, you can do that too..