Analyse diversity in nucleotide sequences of one gene

0

Entering edit mode

6.7 years ago

khhgng ▴ 70

I have a fasta format file with sequences of one gene from 1000 Arabidopsis accessions.

What would be the best way to cluster and visualise these sequences on basis of SNPs among them ?

Many thanks

SNP PCA Phylogenetic tree • 1.2k views

ADD COMMENT • link updated 6.6 years ago by Biostar 20 • written 6.7 years ago by khhgng ▴ 70

0

Entering edit mode

Multiple sequence alignment using t-coffee, MAFFT, muscle.

ADD REPLY • link 6.7 years ago by GenoMax 147k

0

Entering edit mode

Thanks. So in that case what's a better approach for making a tree considering very high sequence similarity - Neighbour joining or parsimony or ...?

ADD REPLY • link 6.7 years ago by khhgng ▴ 70

0

Entering edit mode

Depends on what you want to do. If you only want to visualize the SNP's then you don't need to make a tree.

ADD REPLY • link 6.7 years ago by GenoMax 147k

0

Entering edit mode

I would rather want to cluster them based on SNPs. That may bring more sense to the evolutionary part regarding this one locus.

ADD REPLY • link 6.7 years ago by khhgng ▴ 70

0

Entering edit mode

And that may automatically happen. Run the MSA and then you could edit it (within reason) to show what you want to demonstrate.

ADD REPLY • link 6.7 years ago by GenoMax 147k

0

Entering edit mode

If your alignment is good/close it probably won't make a difference what method you use.

ADD REPLY • link 6.6 years ago by Joe 21k

Login before adding your answer.