For an organism which does not have a reference genome. How does one understand which contig of the assembled strands map to chromosome-N.
For an organism which does not have a reference genome. How does one understand which contig of the assembled strands map to chromosome-N.
Basically, what you're asking is how to scaffold the assembled contigs. To do this, you need some further information. Note that this isn't simple, and your contigs likely have many problems which makes this difficult.
If it's important, you can check your scaffolding using PCR by designing primers around the edges of the gap between contigs.
Scaffolding is great, but it doesn't get you a chromosome assignment. If you want to be able to order and orient scaffolds to build a chromosome representation you will need an independent map source. That is, you need markers (SNPs, STSs, genes, etc) that have been mapped to chromosomes using a sequence independent method (linkage mapping, RH mapping, FISH mapping). If you can find the same markers in your scaffolds then you can start ordering and orienting the scaffolds along the chromosomes. Early maps such as this (http://www.ncbi.nlm.nih.gov/pubmed/9149939) were critical to ordering and this (http://www.ncbi.nlm.nih.gov/pubmed/16843097) were critical for ordering and orienting human scaffold data to produce the first chromosome assemblies.
I don't think there are any NGS assemblers that will do this out of the box. This is likely software you (or someone) would have to write after you had performed your scaffolding. Depending on the size of your scaffolds/genomes and the quality of maps you might even be able to do this manually (but I doubt it would be fun). Look at the paper describing the human draft assembly.
It appears that Hi-C sequencing can be used to efficiently group/arrange contigs on chromosome level:
http://www.nature.com/nbt/journal/v31/n12/full/nbt.2764.html?WT.ec_id=NBT-201312
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Has anything changed in the last 3 years?
Any new tools, methods, pipelines or workflows?
Well longer reads such as PacBio or NanoPore have certainly helped a lot of scaffolds, but people still use genetic maps to assign contigs to chromosomes. Population-wide sequencing has become easier and cheaper for that, have a look at the relatively recent POPSEQ