I need to pairwise align protein structures.
Specifically, my task requires
- alignment of structure predictions in PDB format, for EACH of ~1000 full length proteins
versus
- X-ray crystallography based structure PDB files for Pfam/hmmsearch delimited domain only regions of EACH of these 6 (shorter domain) sequences.
This means ~1000*6 = 6,000 pairwise structure alignment runs.
Which software do you recommend from Wiki or elsewhere, with my following requirements:
- I should be able to download and run on my local laptop or university HPC, and not on a webserver (unless batch submission and download are allowed).
- The runtime, diskspace and RAM needed for each of my 6,000 pairwise alignment need to be reasonable for either my local laptop or on my univ's HPC (with ~2GB RAM nodes and 12cpu limit on my user account)
- I need to be able to parse the alignment results programmatically (using Perl or Python scripting...)
FINAL GOAL = Classify and/or rank each of the full-length proteins based on likelihood of domainĀ of interest being "present", not using pairwise sequence alignments, but from reported metrics of pairwise structure alignments (rather than my arbitrary yes/no/maybe classification)
If your suggested bioinformatics pipeline has been published, better yet!
Thank you all, in advance. Cheers!
I need to do something similar. What software did you end up choosing?