What is a good tool to count the number of k-mers (with k being > 30), where we say that the reverse complement of a k-mer is the same as the k-mer itself? In other words, I would want ACCT and AGGT to count as the same k-mer. Does tallymer do this?
Thanks.
Well, I do want to count both strands, but more importantly, I want to make sure that ACCT and AGGT is counted once and not twice. It doesn't appear that Jellyfish does that, but I could be mistaken.
@Bob. If you use the
-C
flag, then one occurrence of ACCT and one occurrence of AGGT would count as two occurrences of a ACCT/AGGT kmer. The output file will just show ACCT or AGGT with the count, but I'm not sure how it decides which to use.