I want to align 2 sequences locally (Smith Waterman algorithm) ==> the output will be several alignments some of them unlike others is significant. What I want to do is to test the significance by a randomization/permutation test. And below is my approach:
- Align S1 to S2
- ==> obtain for example 10 alignments (Ai) longer then some threshold
- for each Ai test if significant :
- Do a permutation of S1 and S2 and align those permutations
- We obtain a score for each alignment
- Calculate the probability of those alignments that have a score > the initial score obtained by the initial alignment Question : how to continue ? how to decide if the alignment is significant? And are these probabilities obtained considered as p-values ? (my statistical knowledge is humble) Thanks in advance
1- But the query sequence could be found in more than one region in the reference sequence, thus I don't only need the alignment with the highest score, I need all the alignment whose length is above some threshhold 3- How to slide the the two sequences ? can you please give a small example ?
The standard SW only gives a single best alignment. There are variants that can give you multiple, but they are not SW and I have not seen them used in practice. You should just pick up an aligner. Forget about SW.