Question

How to align long primary transcript sequences?

0

Entering edit mode

9.7 years ago

patricia_icbc • 0

Hello,

Does anyone know how to do pairwise alignment of very long primary transcript(with long introns, e.g. 6k) sequences?

Will lastz/chain pipeline work? If so, could anyone please tell me how?

I have been bothered by this question for a long time and really appreciate any of your help.

Tricia

alignment • 2.5k views

ADD COMMENT • link updated 2.5 years ago by Ram 44k • written 9.7 years ago by patricia_icbc • 0

Ram · Answer 1 · 2015-03-17

1

Entering edit mode

9.7 years ago

thackl ★ 3.0k

If you want to align unspliced transcript vs. unspliced transcript, try exonerate (exonerate --model genome2genome0. It models introns on both, query and target sequence.

ADD COMMENT • link updated 2.5 years ago by Ram 44k • written 9.7 years ago by thackl ★ 3.0k

0

Entering edit mode

Thanks, thackl.

I tried to use exonerate with --model genome2genome, but the output is just alignment of some fragments. Can exonerate generate an alignment with two sequences aligned from end to end?

ADD REPLY • link updated 2.5 years ago by Ram 44k • written 9.7 years ago by patricia_icbc • 0

0

Entering edit mode

I was actually hoping that genome2genome would do this But you are right, as a second look at the available models in the manual confirmed - genome2genome is not a global alignment mode.

Exonerate can run global alignments, e.g. --model affine:global, however, this mode does not model introns...

There might be a few ways to improve your result (not sure though, if this gets you far enough...)

1) use genome2genome, but try to prevent exonerate from splitting the alignment. Have a look at --refine, --exhaustive and Intron Modeling Options

2) use affine:global but maybe with adjusted --gapextend costs to allow for long gaps (and also use--exhaustive and --refine)

ADD REPLY • link 9.7 years ago by thackl ★ 3.0k

0

Entering edit mode

It would also help to know a little bit more about your sequences. Are you aligning transcripts from different organisms, or different paralogs, or ..? And are you expecting minor differences, primarily in introns or in exons, or are you looking for structural differences?

ADD REPLY • link 9.7 years ago by thackl ★ 3.0k

Ram · Answer 2 · 2015-03-20

0

Entering edit mode

9.7 years ago

Prakki Rama ★ 2.7k

You can also try mapping using GMAP if you align to genome.

ADD COMMENT • link updated 2.5 years ago by Ram 44k • written 9.7 years ago by Prakki Rama ★ 2.7k

0

Entering edit mode

Thanks, Prakki.

Cound GMAP align genomic sequences?

ADD REPLY • link 9.7 years ago by patricia_icbc • 0

0

Entering edit mode

It seems it aligns only cDNA and ests. Check the link I provided for more details.