vsearch: understanding unique query sequences / align ALL
0
0
Entering edit mode
2.7 years ago
mkk2152 • 0

I am pretty new to sequence alignment issues.

I am using vsearch --usearch_global (v2.21.1) to align short sequences (in fasta file) to a reference. The message I'm getting is:

Matching unique query sequences: 8016 of 11169 (71.77%)

I'm not 100% sure how it determines the number of unique query sequences, because there are no strictly identical sequences there.

I didn't find a straightforward answer to this either in the paper or in the manual - perhaps because the procedure is so standard? Based on experimenting with the --wordlength and --minwordmatches parameters, I'm guessing that it encodes each sequence into words, and if two sequences have the same encoding, it will only align the first one. Is this true?

And is there a sure fire way to force vsearch to align ALL the sequences?

alignment vsearch • 281 views
ADD COMMENT

Login before adding your answer.

Traffic: 1664 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6