I have 20 nucleotide sequences of a particular gene. I collected these sequences from 20 different strains of a virus. My target is to separate them into different groups based on similarity. Finally, I will find out a consensus sequence from each group. How I can do this???
By doing a multiple sequence alignment. You can use a local program like
MEGA/clustal
or an online web interface e.g. clustal omega.As far as I know, through MSA I will get only one consensus sequence for all 20 nucleotide sequences. But, I want to group the sequence first (maybe 3 or 4 groups), the most similar sequence will be put together and one consensus seq for each group. Actually I'm very new to Bioinformatics, maybe I am wrong.
If you know which sequences are more homologous to each other, you could separate them before doing individual MSA's.
If you don't have an idea, then try doing an initial MSA with all sequences (since you said they are from a particular gene they should be reasonably homologous to do that alignment). Examine the results of the alignment (plot a distance tree) and then decide on the groups you want to break the sequences into before doing individual alignments to get a consensus.