Question

How Does Assembled Contigs Get Mapped To A Chromosome?

0

Entering edit mode

12.2 years ago

Hranjeev ★ 1.5k

For an organism which does not have a reference genome. How does one understand which contig of the assembled strands map to chromosome-N.

next-gen chromosome • 16k views

ADD COMMENT • link updated 2.1 years ago by Ram 44k • written 12.2 years ago by Hranjeev ★ 1.5k

0

Entering edit mode

Has anything changed in the last 3 years?

Any new tools, methods, pipelines or workflows?

ADD REPLY • link updated 2.1 years ago by Ram 44k • written 9.3 years ago by Ric ▴ 440

1

Entering edit mode

Well longer reads such as PacBio or NanoPore have certainly helped a lot of scaffolds, but people still use genetic maps to assign contigs to chromosomes. Population-wide sequencing has become easier and cheaper for that, have a look at the relatively recent POPSEQ

ADD REPLY • link updated 2.1 years ago by Ram 44k • written 9.3 years ago by Philipp Bayer 8.7k

Ram · Answer 1 · 2012-09-20

Basically, what you're asking is how to scaffold the assembled contigs. To do this, you need some further information. Note that this isn't simple, and your contigs likely have many problems which makes this difficult.

The obvious one is using more sequence data, either paired-end (short inserts) or mate-pair 2nd gen reads (longer distances), or fosmid/cosmid/BAC ends (typically sequenced using Sanger). I'm using RNAseq for this, which seems an obvious thing to do, but I'm not sure it's very common.
You can use a related genome, and map your contigs to that. This will only be reliable to the extent the genomes are closely related.
You can use gene synteny - certain genes tend to occur in a certain order. Again, this depends on how close the other organisms are.
If you have SNP information, you can create a genetic map from linkage groups, this is often successful in grouping contigs by chromosomes.

If it's important, you can check your scaffolding using PCR by designing primers around the edges of the gap between contigs.

score 2 · Answer 2 · 2012-09-26

2

Entering edit mode

12.2 years ago

deanna.church ★ 1.1k

Scaffolding is great, but it doesn't get you a chromosome assignment. If you want to be able to order and orient scaffolds to build a chromosome representation you will need an independent map source. That is, you need markers (SNPs, STSs, genes, etc) that have been mapped to chromosomes using a sequence independent method (linkage mapping, RH mapping, FISH mapping). If you can find the same markers in your scaffolds then you can start ordering and orienting the scaffolds along the chromosomes. Early maps such as this (http://www.ncbi.nlm.nih.gov/pubmed/9149939) were critical to ordering and this (http://www.ncbi.nlm.nih.gov/pubmed/16843097) were critical for ordering and orienting human scaffold data to produce the first chromosome assemblies.

ADD COMMENT • link 12.2 years ago by deanna.church ★ 1.1k

0

Entering edit mode

Thanks for your answer. How are these used in NGS platforms. If you know any papers please do direct me to them. TQ again.

ADD REPLY • link 12.2 years ago by Hranjeev ★ 1.5k

1

Entering edit mode

I don't think there are any NGS assemblers that will do this out of the box. This is likely software you (or someone) would have to write after you had performed your scaffolding. Depending on the size of your scaffolds/genomes and the quality of maps you might even be able to do this manually (but I doubt it would be fun). Look at the paper describing the human draft assembly.

ADD REPLY • link 12.1 years ago by deanna.church ★ 1.1k

score 1 · Answer 3 · 2015-08-21

1

Entering edit mode

9.3 years ago

thackl ★ 3.0k

It appears that Hi-C sequencing can be used to efficiently group/arrange contigs on chromosome level:

http://www.nature.com/nbt/journal/v31/n12/full/nbt.2764.html?WT.ec_id=NBT-201312

ADD COMMENT • link 9.3 years ago by thackl ★ 3.0k