Hi everyone.
I want to find particular functions in my genome so I tried this out : downloading a database from pfam (in my case http://pfam.xfam.org/family/PF00483#tabview=tab3) and then blast it against my genome.
It works if I download the unaligned sequences so I got the entire proteins. My problem is that I want a better specificity so I need the result of the blast against the domain only.
Do you know if it's possible using blastp to take into account dots or dashes ?
(I considered replacing the dots with a letter like X for example but the problem is that the e-value and bit-score might not be interpretable anymore)
Thank you very much, Sofia
It sounds like the input is genomic sequences data in that case, are you employing BLASTX like approach to search against the specific protein database that you have downloaded? Have you explored other options such as HMMER?
if you have proteins you can use hmmer tool with pfam profile file to find out which protein domains your proteins belong to.
I did use hmmer but it doesn't have the option to look through one family domain only. Then I get as result a lot of data I have to go though to analyse for only one function that I am interested in
ok, can you elaborate on your specific research question then? If you only input on pfam domain profile you will only get hits/info for that one, but we might have the wrong idea of what you want to achieve.