Hi, I am a newbie and I have a very stupid question on my mind. I have ~50000 sequences that I need to BLAST against nr database. Does BLAST 2.2.3 comes with a functionality to pick up the job from where it left off as I am running it on a cluster with limited hours for a job? Or if there's a better way to perform this task, then please also suggest. (I'm trying to keep my sequences together for blast and not break into several small tasks).
Why do you want to keep them in one go?? By separating the job, you can have increased speed (multiple run at the same time) and it can help you to solve your problem
I am trying to get the output in xml format which needs to be used further down the line. And I am not sure if breaking down the sequences now and then later concatenating the xml files will be of help.
I think it should be possible to concatenated xml files to have them look like they were generated in a single run. If this is necessary depends on what you are going to do, I think the BioPerl parser will work with a concatenated xml file, also the Tripal importer works with multiple xml files.