As i am trying to do more work in Multiple sequence alignment using clustalw wrapper in biopython, i am wondering how can i pass parameters through clustalcommandline which i fail to do repeatedly.
Parameters in details like :
- Gap open penalty, Gap extension penalty, no end gap(yes, no), gap distance, weight matrix(blosum, pam etc), type (DNA,protein) and other optional parameters.
Right now i am working on default alignment settings provided by clustalw. As default settings are not fulfilling my interest, i am more keen in adding parameters in given below lines.
import sys, subprocess
from Bio import AlignIO
from Bio.Align.Applications import ClustalwCommandline
cline = ClustalwCommandline("clustalw",
infile="opuntia.fasta")
child = subprocess.call(str(cline),
shell=(sys.platform!="win32"))
Moreover, i will be more pleased if you guys will explain how can i know the inputted fasta file is of nucleotide,DNA or protein.
Thanks for your interest
Also try typing help(c) at the python prompt to find out more about the command line wrapper object you've just created.
Ok let me try and see whether my understanding will work or not. If not then i will again click you. Thanks
you are welcome... I don't want to be silly, but please consider voting up the answers that you find useful, even if you don't want to accept them :-)
Can you answer my below query about inputted fasta file.
I think you should ask that as a separate question. In principle, neither clustalw nor Bio.Align.Applications from Biopython have tools to determine whether a sequence is protein or dna.
Okei i will do that.