I intend to use a similarity based binning program like MEGAN or SOrt-ITEMS or CARMA for analyzing sequences in my metagenomic data set. For this, I first have to generate a BLASTx output of my metagenomic sequences against a huge data base such as nr or pfam. I do not have huge computing resources to run a standalone for the same. Any suggestions as to how do I obtain a blastx output for a million sequences.
I did much the same recently for several million metagenome sequences. Fortunately the researcher was patient, so we just waited until it finished (spread across as many CPUs as we could spare). You need to prioritise time taken vs the cost. If this is a one off computational step, and you have no resources locally, EC2 as Brad suggests, is the way to go.