Entering edit mode
8.2 years ago
Ric
▴
440
Hi,
Abyss 2 introduces a new Bloom filter assembly mode that enables large genome assemblies with minimal memory. Bloom filter assembly mode is enabled by adding three 'abyss-pe' parameters
B
(Bloom filter size), H
(number of Bloom filter hash functions), and kc
(minimum k-mer count threshold).
How do I determine these parameters?
Thank you in advance.
Mic
Probably this post might help -
abyss/konnector2 : Usage Scenario
The kmer count depends on your coverage too, I usually calculate that with bbnorm. For bloom filter size, Ben suggests 40G for 80X human data, so depends on how much data you have now. I have no idea about the number of hash functions. How about running at different values and check how well the FPR decreases from the log file. Not optimal, but practical I guess.