I want to do some realignment of a segment of the genome that show conservation between different species (human, zebrafish, mouse, rat,etc). I know the genomic coordinate of the human region and if I just view the region on human in UCSC genome browser, I can see the Multiz sequence alignment track. What I need is a way to extract the sequences that contribute to the sequence alignment in the view range. I looked around but couldn't find a function to do this. I'm wondering what's the traditional way of doing this?
I can think of using liftover to find corresponding syntenic regions of the human sequence and do this for each species. Is this the right way to go? Are there some tools out there that can do it more efficiently?
Thanks!
thanks! I'm not familiar with Ensembl browser, where can I access and download the alignments on the browser?
Go to ensembl.org, select the species e.g. human and enter the region you are interested in (note the format should be like this 17:63973115-64437414) in one of the search boxes. Hit 'Go'. You are now in the Location tab. In the left hand side menu there is the link called 'Alignments (text)'. Click on it, select the alignment (e.g. 17 eutherian mammals EPO) and then click Go. Once you select which block alignment to view, you then click on 'Download the alignment' button and select the format you wish to export it (e.g. FAST, etc...). More details on our 'Alignments (text)' can be found in our help page. More details on the comparative genomics aspect of Ensembl can be found in this tutorial video. More details on the Download feature are depicted in our blog post.
Hi Denise, Can you please tell me how do this using perl API. I am trying to use the code from compara tutorials, but I do not want pairwise alignments , as extracted by this
fetch_by_method_link_type_GenomeDBs('LASTZ_NET')
. I want the alignments in multiple species , for example for 17 eutherian mammals EPO. How to do this using API. I cannot find the method name to use for this instead oflastz_net
. Thanks Sid