Stockholm Format From A Msa + Structure?
3
3
Entering edit mode
13.5 years ago

Has anyone worked with Stockholm Format? Particularly is there any tools out there to generate a stockholm formatted file including the predicted secondary structure?

Ralee comes with scripts that help convert clustal formatted files but I cannot work out how to automate adding RNA secondary structure (or from what package it should be generated).

format secondary alignment • 6.5k views
ADD COMMENT
7
Entering edit mode
13.4 years ago
Asaf 10k

I've just implemented this feature in my biopython fork: https://github.com/asafpr/biopython/tree/alignSS

I took the RNA package from krother and added a wrapper to RNAalifold. If you'll install my fork you could just write:

from Bio import AlignIO
from Bio import RNA
msa = AlignIO.read(file,format)
ss = RNA.predictStruct(msa,'RNAalifold')
msa.setSS(ss)
AlignIO.write(msa,file,'stockholm')

Of course you'll have to have RNAalifold in your path hope it will help

ADD COMMENT
2
Entering edit mode
13.5 years ago

The Vienna secondary structure package predicts secondary structures for RNAs, and they will output the ss line at the bottom of the stockholm format (http://www.tbi.univie.ac.at/~ivo/RNA/).

All the models in RFAM contain an SS line, and they can be downloaded in bulk from their ftp site (http://rfam.sanger.ac.uk).

If you have the multiple alignment on one side and the SS line on the other, I haven't found a way to combine them either, other than scripting it myself by adding the lines at the end of the stockholm file.

ADD COMMENT
1
Entering edit mode

thanks but how to combine the secondary structure from RNAfold with a multiple sequence alignment to form a stockholm formated file? For my workflow I need a program/script for this: I feel this must have been done before.

ADD REPLY
0
Entering edit mode

No automated method that I know

ADD REPLY
0
Entering edit mode
13.3 years ago
DG 7.3k

I use stockholm format quite a bit as the output from doing alignments with FSA or HMMER3 (confidence values for columns), it is handy for letting you mark up sequences. If you have any scripting experience it is trivial to output valid stockholm format files. A basic file looks a lot like phylip format, just with special mark-up lines that contain data other than sequence.

ADD COMMENT

Login before adding your answer.

Traffic: 1837 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6