How Does Assembled Contigs Get Mapped To A Chromosome?
3
0
Entering edit mode
12.2 years ago
Hranjeev ★ 1.5k

For an organism which does not have a reference genome. How does one understand which contig of the assembled strands map to chromosome-N.

next-gen chromosome • 16k views
ADD COMMENT
0
Entering edit mode

Has anything changed in the last 3 years?

Any new tools, methods, pipelines or workflows?

ADD REPLY
1
Entering edit mode

Well longer reads such as PacBio or NanoPore have certainly helped a lot of scaffolds, but people still use genetic maps to assign contigs to chromosomes. Population-wide sequencing has become easier and cheaper for that, have a look at the relatively recent POPSEQ

ADD REPLY
9
Entering edit mode
12.2 years ago
Ketil 4.1k

Basically, what you're asking is how to scaffold the assembled contigs. To do this, you need some further information. Note that this isn't simple, and your contigs likely have many problems which makes this difficult.

  1. The obvious one is using more sequence data, either paired-end (short inserts) or mate-pair 2nd gen reads (longer distances), or fosmid/cosmid/BAC ends (typically sequenced using Sanger). I'm using RNAseq for this, which seems an obvious thing to do, but I'm not sure it's very common.
  2. You can use a related genome, and map your contigs to that. This will only be reliable to the extent the genomes are closely related.
  3. You can use gene synteny - certain genes tend to occur in a certain order. Again, this depends on how close the other organisms are.
  4. If you have SNP information, you can create a genetic map from linkage groups, this is often successful in grouping contigs by chromosomes.

If it's important, you can check your scaffolding using PCR by designing primers around the edges of the gap between contigs.

ADD COMMENT
0
Entering edit mode

I'm interested in your RNASeq approach. How is it done?

ADD REPLY
1
Entering edit mode

Only a prototype so far, but I'm just using RNA evidence to order and orient contigs. Of course, distances are not very precise in the case of introns.

ADD REPLY
2
Entering edit mode
12.2 years ago
deanna.church ★ 1.1k

Scaffolding is great, but it doesn't get you a chromosome assignment. If you want to be able to order and orient scaffolds to build a chromosome representation you will need an independent map source. That is, you need markers (SNPs, STSs, genes, etc) that have been mapped to chromosomes using a sequence independent method (linkage mapping, RH mapping, FISH mapping). If you can find the same markers in your scaffolds then you can start ordering and orienting the scaffolds along the chromosomes. Early maps such as this (http://www.ncbi.nlm.nih.gov/pubmed/9149939) were critical to ordering and this (http://www.ncbi.nlm.nih.gov/pubmed/16843097) were critical for ordering and orienting human scaffold data to produce the first chromosome assemblies.

ADD COMMENT
0
Entering edit mode

Thanks for your answer. How are these used in NGS platforms. If you know any papers please do direct me to them. TQ again.

ADD REPLY
1
Entering edit mode

I don't think there are any NGS assemblers that will do this out of the box. This is likely software you (or someone) would have to write after you had performed your scaffolding. Depending on the size of your scaffolds/genomes and the quality of maps you might even be able to do this manually (but I doubt it would be fun). Look at the paper describing the human draft assembly.

ADD REPLY
1
Entering edit mode
9.3 years ago
thackl ★ 3.0k

It appears that Hi-C sequencing can be used to efficiently group/arrange contigs on chromosome level:

http://www.nature.com/nbt/journal/v31/n12/full/nbt.2764.html?WT.ec_id=NBT-201312

ADD COMMENT

Login before adding your answer.

Traffic: 1772 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6