Clustering program based on NJ phylogeny
2
0
Entering edit mode
7.2 years ago
kgbenn123 ▴ 20

Does anyone have a recommendation for a clustering program that uses neighbor-joining phylogeny to cluster a dataset of protein sequences and output a representative from each cluster? I'm thinking of something that works like cdhit, but uses NJ instead of sequence identity for clustering.

Any suggestions?

clustering protein NJ Phylogeny • 1.8k views
ADD COMMENT
1
Entering edit mode
ADD COMMENT
0
Entering edit mode

Phylip doesn't appear to have the function I'm looking for. I'm working with about 6000 seqs and want a program that can cluster through neighbor joining and output either a list of accessions that represent their respective clusters or some file format that allows me to extract them.

ADD REPLY
0
Entering edit mode
5.8 years ago
gbl1 ▴ 80

Even if late NJ (neighbourth joining) clustering is pretty easy to do on R function nj()

ADD COMMENT

Login before adding your answer.

Traffic: 2367 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6