Problem: I have a multiple alignment of sequence reads. Each individual is represented by two sequences, forward and reverse (e.g. individual1_For
and individual1_Rev
) that don't overlap at the ends. I want my alignment to have one sequence per individual that is the consensus of the forward and reverse sequences. NB I don't want to create a consensus across all individuals. I want to recognise sequences that share a name e.g. "individual1" and replace them with a full-length consensus of those two sequences.
Does anybody know of existing script or programs to do this? If it doesn't already exist I shall probably start putting something together with BioPerl, any suggestions of best approach?
what is the main reason to do this? I mean you are going to use the output for? I ask this because you can use a logo to display the consensus.
what does it means to calculate the consensus between the forward and reversed sequence? Could you make an example of your input and desired output?