Hello all,
I am completely new for this kind of tasks. I have data like this,
>in0 GATCCTCGAAGTTACACGGG >in1 TACGTCGACGTCAATCCGGG >in2 TACACGGGCCGCTCCTGGGC >in3 ACGGGGTACTACGAGACGCG >in4 AGGGGGAATGTGGTCCACAT >in5 TCCACATGGCTTGCTCCTGA >in6 CTTGACGTTATGAATTTCGC
and so on..I need to assemble these short reads. I want to use perl for this. I just need a pseudo code on how to do this or direct me to a good resource. At the end I need a single string containing consensus sequence.
Is their a reason you want to reinvent the assembly wheel? There are a good number of assemblers already written, why bother writing yet another one without a good reason?
Can you give me some examples, so that I can find them directly on the internet.
You could try google
As orange said, SOAPdenovo is one option. Others would include Trinity and Minia. There are quite a few of these if you just search pubmed for "DNA assembler" or "DNA assemble".