kmer analysis for genome size estimation
1
0
Entering edit mode
7 months ago
rj.rezwan ▴ 10

Hi, I used the PacBio HiFi reads in Merqury/1.9 for kmer analysis of our allotetraploid species. I have shared the figure of genomescope plot, which showing the size of 1.5gb. I am astonished what's wrong here. Can someone guide me in this? For more information, the diploid species has a genome size 1.5 gb. I shall be grateful to you. First, I assembled the genome using hifiasm with size of ~3.7gb.

Second question is why the observed peaks are going out of the model peak?

enter image description here

kmer merqury allotetraploid pacbio hifiasm • 664 views
ADD COMMENT
0
Entering edit mode

Isn't this size estimate a good fit? I am not sure, but is Ploidy taken into account at all for this? You can also see that dup:4.11 which would fit with tetraploidy. Also, even if there was very high heterozygosity in the genome, this would at best add a fraction of a percent more unique k-mers.

ADD REPLY
0
Entering edit mode

yes, this thing want to ask why this does not give a length near to my assembly size? and please look at the observed peak (blue), is this normal to be out of the model peak (black)?

ADD REPLY
0
Entering edit mode
7 months ago

The oh-know materials have all kinds of example plots for various genome cases or extra settings for cases where the size estimation didn't work out. In your case, looks a little bit like the allotetraploid example plot? https://github.com/KamilSJaron/oh-know/wiki/Allotetraploid

ADD COMMENT
0
Entering edit mode

yes, genome is allotetraploid

ADD REPLY

Login before adding your answer.

Traffic: 2406 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6