How to quickly find the position of k-mer on the genome
1
0
Entering edit mode
3.7 years ago
slbai00001 • 0

Hi, all

How to quickly find the position of k-mers on the genome? Currently, I use software such as bwa and fuzznucl, but I feel it is not fast enough.

alignment sequencing k-mer • 1.9k views
ADD COMMENT
0
Entering edit mode

Are you looking to find one specific k-mer or a map of all k-mers?

ADD REPLY
0
Entering edit mode

In fact, I first used kmc to obtain all the specific k-mers of one genome relative to the other genome, and then I wanted to obtain the position of these k-mers on the genome. The number of k-mers was about several million or even a few giga.

ADD REPLY
0
Entering edit mode

Yes. Can I ask for your penultimate goal? Unless you're going to write a new assembler, I'm not sure what you would need that map for.

ADD REPLY
0
Entering edit mode

Did you find an efficient solution? I would like to perform the same task.

ADD REPLY
0
Entering edit mode
3.7 years ago

Something similar has been asked here. In essence, you can use motif finding tools such as fuzznuc from the EMBOSS suite, BLAT with a small tile size and options -minMatch and -minIdentity, or seqkit locate, depending on your input/output format requirement.

But yes, as you already noticed, it will be slow because there are lots of k-mers. You could also format your k-mer list in fastq/fasta format and use BWA with zero mismatches allowed.

ADD COMMENT

Login before adding your answer.

Traffic: 2729 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6