I am using biopython for converting my aligned sequence file from one format to another using "AlignIO.write"
function. The thing I want to know is that,
- When I convert a sequence file given in phylip format to a file of clustal format, the resulted clustal file is of version
CLUSTAL X (1.81)
while I want an output in the form of CLUSTAL W / CLUSTAL 2.1 or so (higher version of clustal). - The resulted clustal file does not include the line in the alignment with symbols (
'*','.',':'
) for the amino acids in the sequence file.
This is the resulted file I am getting.
CLUSTAL X (1.81) multiple sequence alignment
Canis REWSSARPERSKGRRKPVDAAAVSAVQTSQTSSDVAVSSSCRSMEMQDLTSPHSRLSGSS
Mus -------------------------------------------MEMQDLTSPHSRLSGSS
Rattus -------------------------------------------MEMQDLTSPHSRLSGSS
While this is the file format I want to get, including the symbols line indicating the similarity of the residues.
Canis REWSSARPERSKGRRKPVDAAAVSAVQTSQTSSDVAVSSSCRSMEMQDLTSPHSRLSGSS
Mus -------------------------------------------MEMQDLTSPHSRLSGSS
Rattus -------------------------------------------MEMQDLTSPHSRLSGSS
*****************
Can I do this with biopython or I have to use some other method or function??
Hello mdsiddra,
could you please also provide an example of the phylip input file?
fin swimmer
Yes, This is how a phylip file look like:
Hi,
I think seqret from EMBOSS can do the job. You can set the output format as clustal.
I don't want it this way. As I am using python/biopython codes, so I wish to use some source code for this purpose.
What version of Biopython are you on?
Python 3.6 and biopython 1.72