how do I assemble centromere regions in a plant genome?
1
0
Entering edit mode
12 months ago
evyk • 0

I'm assembling a new genome for a plant de novo. I already have the scaffolds and most of the assembly but the parts in the centromere regions won't align against the reference sequence i have of the plant. This reference sequence is not of good quality. Is there a tool to get a good assembly of the centromere regions or an other way to get the centromere regions right?

novo de assembly • 554 views
ADD COMMENT
0
Entering edit mode

I think the short answer is "you don't".

ADD REPLY
2
Entering edit mode
12 months ago

This is hard. Use very long reads eg Nanopore, and also Pacbio. Use a very high coverage (the human T2T project used 170X of both if I remember rightly). Employ centromere experts and a large multinational team with extensive funding.

You can see this isn't trivial right ?

Why don't you start by using a telomere finder like tidk https://github.com/tolkit/telomeric-identifier to discover if the telomeres are all present ?

Also, you'll need to give details. Is this a small plant <1 GB, or a massive polyploid, or something inbetween ?

I haven't found a good tool to detect centromeres yet, but would be interested in your approach.

ADD COMMENT

Login before adding your answer.

Traffic: 2357 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6