How to find Bloom filter parameter for Abyss 2
0
0
Entering edit mode
8.2 years ago
Ric ▴ 440

Hi, Abyss 2 introduces a new Bloom filter assembly mode that enables large genome assemblies with minimal memory. Bloom filter assembly mode is enabled by adding three 'abyss-pe' parameters B (Bloom filter size), H (number of Bloom filter hash functions), and kc (minimum k-mer count threshold).

How do I determine these parameters?

Thank you in advance.

Mic

Assembly genome bloom filter • 2.7k views
ADD COMMENT
0
Entering edit mode

Probably this post might help -

abyss/konnector2 : Usage Scenario

The kmer count depends on your coverage too, I usually calculate that with bbnorm. For bloom filter size, Ben suggests 40G for 80X human data, so depends on how much data you have now. I have no idea about the number of hash functions. How about running at different values and check how well the FPR decreases from the log file. Not optimal, but practical I guess.

ADD REPLY

Login before adding your answer.

Traffic: 1877 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6