Hi! I need your help. I'm working on a ML model and I need to compare a wt protein sequence to a list of 3000 mutated sequences. every mt sequence contains a single point mutation. the aim is to find every mutation and create two columns, one for the mutation position and the other for the type of mutation (aa symbol). Can you help me, please?
You can do this easily with a multiple sequence alignment tool and the
AlignIO
module of biopython.Biostars is not a code writing service though. Judging by your tags you already know where to start, so please have a go and we will gladly help troubleshoot.