I want to generate a multiple sequence alignment that retains the annotations I made on a list of protein sequences (eg. Stockholm file format). Is there a way to do this? Maybe with Biopython or Biostrings?
In the end, all I want is a MSA with the same annotations as the orignal sequences, so any way that I can add annotations to a MSA without annotations will work as well.
My files are currently in genbank format, is there a way to transfer them to fasta while retaining the annotations so that they are compatible with MUSCLE?
Also, these are annotations I made on domains in the sequence as opposed to the sequence as a whole
My sequences are currrently in Geneious. So I have an annotation table, but I can only export the file with annotations to genbank
The old version of
esl-reformat
issreformat
, and it can convert GenBank to FASTa.You will need to go to older HMMer version, I think
v2.3
, to find this program.If this doesn't work, Google is your friend. There should be plenty of programs or scripts to convert GenBank to FASTa.