Genome sequencing project opinions & help - heterozygous species
1
0
Entering edit mode
7.4 years ago
lessismore ★ 1.4k

Hi all,

as here there are notably many experts in this field i am curious to have your opinion about a delicate and arguable topic.

According to the current technologies, what kind of sequencing (technology/platform, read length and coverage) do you suggest for efficiently sequencing and, hopefully assembly, the genome of a plant species of about 1,2 Gb and highly heterozygous ?

Thanks for your help

sequencing next-gen Assembly genome • 1.2k views
ADD COMMENT
1
Entering edit mode
7.4 years ago
shwethacm ▴ 240

Hi there,

To begin with, highly heterozygous genomes are not the easiest to assemble using genome sequencing technologies and assemblers that we have today. Be prepared to get a rather fragmented assembly with low-moderate N50.

I would suggest a combination of short-read (Illumina) and long-read (PacBio) technologies for your genome. Most of the recent large-genome assemblies have adopted this combination and it has worked well in almost all cases. Build the skeleton genome assembly using long reads, and use the short read data to "fill-in" and polish your assembly.

If you only want to use short-reads, Redundans is an assembler that works with short-read data and is specialized for highly heterozygous genomes. You could plan your assembly project using short reads alone with this assembler in mind. Plan your sequencing depth and insert sizes according to the recommended values so you have the right 'type' of data before you assemble.

Also, this might be of interest to you: This preprint that just came out describes the sequencing ans assembly of the wheat genome, which is highly heterozygous and massive (>10GB genome size). http://www.biorxiv.org/content/early/2017/07/03/159111

Good luck!

ADD COMMENT

Login before adding your answer.

Traffic: 2122 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6