Hi all, I am very new to bioinformatics and have just started using Biopython. I am looking to see if there is a way to extract parts of a sequences from a large number of protein sequences based on the domain. I have sequences for ~500 proteins and I know the location of my domain in question but need the sequence for just that domain +about 50 residues on both sides so I can do an alignment. The solution does not need to be in Biopython. Just really need some help. Thank you.
Yes this makes sense. Thank you. However, the domain does not occupy same location for each protein. Sometimes it lies in residues 50-100 while others it may be in 100-150 and so on.