Hi,
I am using mafft aligner for multiple sequences alignment by command line. I am taking output in clustalw format, but the problem is that sequence identifiers are longer than 14-15 characters for every sequences (I have to keep them as they are) and mafft returns only till 14th characters for each ids and I want complete identifiers name in output file. Is there any option to get the full id in mafft output or something I missed in tutorial ???
Thanks in advance !!
Use a different output format. A number of formats have a hard limit on ID characters. Clustal format is similar to strict phylip which has a hard limit of 16 characters.
I’d advise switching to an aligned fasta or something. Almost every tool can accept the latter.
Hi, Thanks for reply!
I need a output in clustal format only.. What should I do ??
Can you not simply edit the output file, which should be plain text (?)? You could use a regex to alter the sequence identifiers via, for example,
sed
. Make it so that it matches beginning of line (^
)Hi, Thanks for your suggestion. I used ClustalO and it solved my issue.
Thankyou so much !!!