Question

Somebody has experiences with Illumina Synthetic Long-Read ?

2

Entering edit mode

8.4 years ago

Picasa ▴ 650

Hey,

Is there anybody have already worked with Illumina Long read ?

http://www.illumina.com/products/truseq-synthetic-long-read-kit.html

If yes I would like your feedbacks

1) It is described to be long up to 10K; is it really the case ?

2) What is the error rate ? (Pacbio is 15%)

3) What's the amount of DNA required ?

4) Is it relevant in case of de novo assembly ?

illumina • 3.6k views

ADD COMMENT • link updated 8.4 years ago by GenoMax 147k • written 8.4 years ago by Picasa ▴ 650

score 2 · Answer 1 · 2016-07-09

2

Entering edit mode

8.4 years ago

Brian Bushnell 20k

2) The error rate is based on Illumina's error rate. The synthetic long reads are just assemblies of many Illumina short reads generated from the same long-ish molecule, so typically, the error rate should be much lower than a single Illumina read... barring misassemblies.

4) It's not really relevant for denovo assembly because it still can't (in general) resolve repeats longer than read length. For assembly it's no better than shotgun sequencing (according to people at my lab who experimented with it), but a lot more expensive.

I imagine it's probably good for phasing, though.

ADD COMMENT • link 8.4 years ago by Brian Bushnell 20k

2

Entering edit mode

We have found it to be useful for resolving genic regions and it can resolve repetitive regions as long as they are not tandem repeats. But we have also found it appears significantly biased for specific genomic regions. Never really saw obvious problems with mis-assemblies.

I'm interested to see how 10x Genomics data pans out, which uses a similar strategy (batches of assemblies) but scaled up using emulsion PCR. They just released their own assembler, Supernova.

ADD REPLY • link 8.4 years ago by Chris Fields ★ 2.2k

1

Entering edit mode

Good point - it should theoretically be able to resolve repeats outside of the "long read", just not inside of it. So, for example, it should be better at assembling ribosomal sequences, which are often present in many copies, but are not tandem repeats.

ADD REPLY • link 8.4 years ago by Brian Bushnell 20k

0

Entering edit mode

Thanks for your answer.

ADD REPLY • link 8.4 years ago by Picasa ▴ 650

score 2 · Answer 2 · 2016-07-09

2

Entering edit mode

8.4 years ago

GenoMax 147k

Individual reads are not 10K long. No current Illumina sequencers can produce reads that long. That is the starting length of DNA that goes into these libraries.
Libraries can be created using as low as 500 ng starting DNA (info from the PDF application note on the page you linked above).

ADD COMMENT • link 8.4 years ago by GenoMax 147k

1

Entering edit mode

We have seen 'reads' (e.g. assembled fragments) up to 10kb and sometimes longer, but they typically average more 6-8kb if you have a decent library prep.

ADD REPLY • link 8.4 years ago by Chris Fields ★ 2.2k

0

Entering edit mode

So Illumina performs an assembly first and give us the long "read". What is the advantage of their method ? they guarantee long fragments ? I mean we could perform ourself that assembly

ADD REPLY • link 8.4 years ago by Picasa ▴ 650

1

Entering edit mode

The advantage is that the long read assembly happens on a smaller scale. If you are doing regular Illumina sequencing, each read can come from anywhere in the genome. With the synthetic long reads, each well only has a few ~10kb fragments. Therefore, each read from that well should assemble into those fragments. You are assembling a small part of a genome as opposed to a whole genome. You could extract the individual short reads and assemble them yourself. In fact, that is what you get by default. You then have to use BaseSpace to do the long read assembly that will generate those synthetic long reads.

Another advantage is that you don't need additional instrumentation if you are an Illumina facility. Since it's using Illumina technology, it's also cheaper than PacBio or Nanopore (a major concern if your genome is over 100 MB).

ADD REPLY • link 8.4 years ago by igor 13k

0

Entering edit mode

Libraries can be created using as low as 500 ng starting DNA (info from the PDF application note on the page you linked above).

That is true, but the DNA has to be of very good quality. My group has done this a few times and we had to restart with new DNA every time because it turned out that it was not good enough despite fulfilling the official requirements.

It's also an extremely laborious process compared to other library preps.

ADD REPLY • link 8.4 years ago by igor 13k