Entering edit mode
7.2 years ago
DL
▴
50
Hello everyone,
I am going to do genome assembly for that i did kmer analysis to see that which kmer value is good for the assembly. genome is heterozygous so i found two peaks in kmer analysis. i performed kmer analysis using kmer value 21, 32 and 64 now i am confused which kmer value is best for assembly. At kmer value=64, i got large genome size. Please explain which parameter i will see for good assembly.
Please Reply as soon as possible
Thanks in advance
From my perspective, everything in your figure says to me that the best assembled genome out of the three is the one with kmer=64. I looked at each statistic and kmer=64 has the best result for each, including the narrowest gap between min and max. A larger k-mer is ideal as it essentially provides for more 'uniqueness' in the base sequences that will form the assembled genome. The downside to using a large k-mer is the computational expense associated with it, as the resulting de Bruijn graph that is generated using such a larger k-mer would be complex.
Some questions for you:
Thank You Kevin for your reply.
i have one doubt. what should i do if the genome is heterozygous ?? for that 63 kmer is ohk ???
For that, I cannot comment - apologies. I just know that by the standard measures, the k-mer=63 genome looks better.
You may find this thread of use: Recommendations For Heterozygous Genome Assembly Software
I also found this recent (2014) publication: https://www.ncbi.nlm.nih.gov/pubmed/24755901
Apologies that I cannot assist further!