Dear all,
I am trying out kmergenie to determine optimal kmer values for plant genome assembly. Using the default settings I get a nice histogram for all the different k-mers, but using the --diploid parameter the histogram is truncated. See .dat outputs below.
It seems that many of the k-mer histograms do not have any associated model fits (this is apparent in the html output (not shown here). Do you know what may be going wrong?
Many thanks for any advice!
Robin
## Default (haploid) model
$ kmergenie filelist.txt -k 85 -t 8 -o kmergenie:
k genomic.kmers cov.cutoff
15 135570989 1
25 373492340 1
35 425306591 1
45 460648430 1
55 480548886 1
59 487082292 1
61 486570123 1
63 486719928 1
65 488075925 1
67 485561924 1
69 484863969 1
71 483620710 1
75 468859001 1
85 1932760 22
## Diploid model
$ kmergenie filelist.txt --diploid -k 85 -t 8 -o kmergeniediploid #note that estimates for k15, k45 and k>57 are missing
k genomic.kmers cov.cutoff
25 349086825 1
35 390717507 1
51 425391521 1
53 425679437 1
55 426942015 1
57 426544332 1
Can you please send both HTML reports to kmergenie@cse.psu.edu?
The diploid model is more constrained, so it has higher chance to not fit to an histogram, as opposed to the haploid model, that is less constrained.