clustering sequences in two different assemblies to find similarities based on sequence
1
0
Entering edit mode
7.7 years ago
Mehmet ▴ 820

Dear all,

I have two assembly, and I want to cluster of each sequence in two assemblies to find how similar the two assembly in terms of sequence. I used cd-hit-est-2d. I also want to use another tool for this. Any recommendation?

Thank you

gene sequence alignment • 1.7k views
ADD COMMENT
0
Entering edit mode

Nucmer or mummer is just what you are lookingfor (I think)

ADD REPLY
0
Entering edit mode
7.7 years ago

The other strategy I would recommed is to translate and get the peptides in all frames and do a protein level clustering. You could use latest tools like MMseq, https://github.com/soedinglab/MMseqs2 to do the clustering .

ADD COMMENT

Login before adding your answer.

Traffic: 1706 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6