Hi there,
I have 80 DNA samples in which we have sequences three overlapping sections of a large gene using the sanger method. I am now looking for a way to merge these sequenced segments of the gene into a single sequence for each sample so that they can be aligned for analysis.
I have come across a couple of tools for doing this with just two overlapping sequences (such as emboss), and i've seen that this can be done with bioedit for one sample at a time, but is there a tool that can allow me to do this in bulk. or will i have to align and assemble them as i would with ngs data of a genome?
Thankful for any answers.
Kind Regards, Tom
You should give
tadpole.sh
from BBMap a try. It should work with fasta formatted sequences.This would be trivially simple if you had access to Sequencher, DNASTAR, ContigExpress from Vector NTI among others (Note: these are all commercial software packages and are not free). Consed suite will work as well but it will require signing an academic agreement and some effort on your part to install everything.