Hello, I am new in field of bioinformatics. I have to do similarity search on protein sequences but the number of bacterial strain is large. There are various tools available for genome comparison but I don't know any tool that compare entire proteome of several bacterial strains. BLAST is used for similarity search tool but I don't have an idea of maximum number of bacterial strains that can be used for one Blast Search. Does anyone know of a tool that will help me to identify conserved regions in 5500 bacterial strains.
Thanks
Try this site:
http://scratch.proteomics.ics.uci.edu/
As far as I have understood you have a lot of bacterial proteins,
and you know just their sequences. Do you know the bacterial
host for each of them? You need their domains/ domain prediction, right?
"DOMpro
DOMpro predicts domain locations using a 1D-RNN. DOMpro takes an input the sequence profile, predicted secondary structure, and predicted relative solvent accessiblity. The output of the 1D-RNN is a classification for each residue as being in a domain boundary region or not. The domains are then infered from this output. For a more detailed explanation, see the manuscript in references. "
http://scratch.proteomics.ics.uci.edu/explanation.html#DOMpro
try DOMpro. There is an article mentioned on that site below:
http://download.igb.uci.edu/domain.pdf
DOMpro: Protein Domain Prediction Using Profies, Secondary Structure, Relative Solvent Accessibility,and Recursive Neural Networks