Hello,
Is there a simple way of extracting all sequences for a specific domains from hmmer vs Pfam results ?
I have used esl-reformt to output the fasta containing the sequence for each domain found in my input, however it doesn't specify the domain of each sequence. Many domains were found in the original multi fasta file, but I just want the sequences corresponding to one to be extracted.
Thanks in advance for any advice,
Emily
not sure what or how you did it but if you run your proteins through InterPro you will get per input protein a list of all it matching domains.
From the output file you can then either grep on the a protein ID to get all its domains or grep a domainID to get all proteins that have that specific domain