Question

How To Do Multiple Whole Genomes Alignments?

4

Entering edit mode

13.3 years ago

Ning-Yi Shao ▴ 390

The problem now I will solve is that: we have 3 species, such as A, B, C. All of them are primates and have reference genomes. And we hope to get highly accurate multiple alignments of the whole genomes for further study.

I found webpage of UCSC:

http://genomewiki.ucsc.edu/index.php/Whole_genome_alignment_howto

But it is based on the case of alignments of 2 species, and I know they have multiple genomes alignments but I don't find the method section to describe it.

I also find the multiple alignment softwares on wikipedia:

http://en.wikipedia.org/wiki/List_of_sequence_alignment_software

But so many softwares, which is the one I need? Which is the fastest and most accurate one?

Any suggestion is welcome, thank you!

multiple evolution genome alignment • 11k views

ADD COMMENT • link updated 11.0 years ago by Biostar 20 • written 13.3 years ago by Ning-Yi Shao ▴ 390

score 11 · Answer 1 · 2011-08-18

11

Entering edit mode

13.3 years ago

2184687-1231-83- ★ 5.1k

If the three primate genomes you are interested in are in the list of publicly available primate genomes in UCSC/Ensembl/NCBI, you could just use the pre-calculated multiple whole-genome alignments in Ensembl Comparative Genomics:

http://www.ensembl.org/info/docs/compara/analyses.html

The current version is a 6-way primate whole-genome alignments set:

6 primates EPO (method_link_type="EPO" : species_set_name="primates") Human (Homo sapiens) Chimpanzee (Pan troglodytes) Gorilla (Gorilla gorilla) Pongo abelii Macaque (Macaca mulatta) Marmoset (Callithrix jacchus)

They can be downloaded from the ftp site here:

ftp://ftp.ensembl.org/pub/current_emf/ensembl-compara/epo_6_primate/

ADD COMMENT • link 13.3 years ago by 2184687-1231-83- ★ 5.1k

0

Entering edit mode

Thank you!

But I need only three primates alignment. Is it possible to find a more detail description of the method to do the alignments? I hope to get exact whole genome alignments of Hg19, panTro3, and rheMac2.

ADD REPLY • link 13.3 years ago by Ning-Yi Shao ▴ 390

0

Entering edit mode

Now I am checking EPO pipeline, would you mind to give me more hint of the pipeline? For example, should I remove all repeat regions? Thank you very much!

ADD REPLY • link 13.3 years ago by Ning-Yi Shao ▴ 390

Ram · Answer 2 · 2011-08-18

3

Entering edit mode

13.3 years ago

Treylathe ▴ 950

If they are three species that are sequenced and complete, you can get an pre-computed alignment of specific regions of the species of choice from UCSC: http://screencast.com/t/JCb3UpsfvSf

I'd also check out Vista for pre-computed whole genome alignments. http://pipeline.lbl.gov/cgi-bin/gateway2

If the species you are looking for are incomplete or not included, or you'd like to do the alignment independently with different parameters, methodology, etc, Multiz is a good start. Vista can be used to add sequences too.

ADD COMMENT • link updated 5.2 years ago by Ram 44k • written 13.3 years ago by Treylathe ▴ 950

0

Entering edit mode

Thank you very much! Now I am trying multiz and EPO pipeline.

ADD REPLY • link 13.3 years ago by Ning-Yi Shao ▴ 390

score 1 · Answer 3 · 2011-12-23

I've used Mauve (http://gel.ahabs.wisc.edu/mauve/) in the past. If you're aligning relatively complete assemblies, it might be what you're looking for. It also has a feature ("move contigs" or contig reordering) where it will rearrange contigs (from, say, a draft assembly) until they're in an order / orientations that maximize alignment with a trusted reference.