Let's say I have a set of short contigs(5 sequences) that I want to assemble into one contig.
I do not want to use multiple sequence alignment, but rather use a template sequence that will be used as a sort of hint.
If I align those 5 sequences against the template, I get a following coordinate for each sequence
blastn -task 'megablast' -query evidence.fa -subject template.fa -outfmt 6 | cut -f1,9,10
piece1 1 234
piece2 56 704
piece3 220 592
piece4 385 704
piece5 510 704
How do I generate a single contig from this? In other words, I want to stitch the contigs by borrowing the information from the template sequence. Is there software for this?
Here are the sequences
>template
CTGACCTCCTGGGAGAGACTATGGATATGATGCTGGGAGAGCCCTGTCAGAGGTAACTATTTGGACATCGGGGAAAAGGGAGACCTCAGGCCTCCATTCAATTTGTATTCAGACTGCTTTCTGGGTAAACTGGACGTTTAGGATTTAGGTGGAAGAACATTTgggatttattaatttattttatataagaaatatacatgtagcatttactatgttcccagttttatttctaagcatttcaAAGATATTAATTCAGTTGAgctaaatgaaagaatactataaaaatatgataagatCAAAATAATCGTAGTCCCAGCAAGCTACTGCTGCTTGCTCAAGTTTGTGAGCCCTGCAGCTGTTCTGTCTGCTGAAGGGAAATGAGCCAACAGCCATGGGGGTGTTAAAGATGTAGCACAAAGCCCAGACTCTCTCAACAACAATCAAGAGgcatgaaaggaaagagaaatggctTTCTCAGATGCCCATCACTGCCATTTGAAAATGTGTCAGTGCATCCTCTAAAATTAACTTCTGTAGAAGCAGAGAAACAAGGCACAGACCTTGGGAAATAAGGGACAACTATGCAGGTACCAGGGGCCAAAGAAGCTTTTTAGCTGAGATTGTGTTTCTTAGAACCAGTAGCGCTGATGAGAAATACCTCTTGTCTGTCTGATTCACCCCCTTCTTGGAGAACCTCCCATTAC
>piece1
CTGACCTCCTGGGAGGGACTATATGGATATGATGCTGGGAGAGCCCTGTCAAGGGACTATTTGGACATCTGGGAAAAGGGAGACCTCAGGCCTCCATTCAATTTGTATTCAGACTGCTTTCTGCGTAAGCTGGGTGTTTAGGATTTAGGTGGAAGAACATTTGGGATTTATTAAATTATTTTATATCAAAAATATACATGTAGCATTTACTATGTTCCCAGTTTTAATTCTAAG
>piece2
AATCAACCAGCATTAATACAGGACTATTTGGACATCTGGGAAAAGGGAGACCTCAGGCCTCCATTCAATTTGTATTCAGACTGCTTTCTGCGTAAGCTGGGTGTTTAGGATTTAGGTGGAAGAACATTTGGGATTTATTAAATTATTTTCTATCAAAAATACACATGTAGCATTTACTATGTTCCCAGTTTTAATTCTAAGCATTTCAAAGATATTAGTTCACTTGGGCTAAATGAAAGAATACTATAAAAGTATGATTGGATCAAAATAATCGTGGTCCCAGCAAGCTTGTGCTGCCTGCTCAAGTTTGTGAGCCCTGCAGCTGTTCTGTCTGCTGAAGGGAAGTGAGCCAACAGCCATGGGGGTGTTAAAGATGTAGCACAGAGCCCAGACACTCTCAACGACAATCAAGAGGCAGGAAAGGAAAGAGAAACAGCTTTCTCAGATGCCCATCACTGCCATTTGAAAATGTGTCTGTGCATCCTCTAAAATTAACTCCTGTAGAAGCAGAGAAGCAAGGCACAGGCATTGGGAAATAAGGGACAACTATGCAGGTACCAGGGGCCAAAGAAGCTTTTTAGCTGAGATTGTGTTTCTTAGAACCAGTAGCACTGATGACAAATTCCCTCTTGTCTGTCTGATTCACCCCTTTCTTGAAGAACCTCTCATTACCCACAGGTCCTGCTAACTGGGCTGTGGGGACATTTGATCCAATCAAGTTCAATTAATCAGTTTCCCTCTTATGGGAATCTGGAATAAAGACATCTTG
>piece3
AGTTTTAATTCTAAGCATTTCAAAGATATTAGTTCACTTGGGCTAAATGAAAGAATACTATAAAAGTATGATTGGATCAAAATAATCGTGGTCCCAGCAAGCTTGTGCTGCCTGCTCAAGTTTGTGAGCCCTGCAGCTGTTCTGTCTGCTGAAGGGAAGTGAGCCAACAGCCATGGGGGTGTTAAAGATGTAGCACAGAGCCCAGACACTCTCAACGACAATCAAGAGGCAGGAAAGGAAAGAGAAACAGCTTTCTCAGATGCCCGTCACTGCCATTTGAAAATGTGTCCGTGTATCCTCTAAAATTAATTCCTGTAGAAGAAGAGAAAGAAGGCACAGGCCTTGGGAAATAAGGGACAACTATGCAGGTACC
>piece4
TGGGGTGTTAAAGAACAGCCATGGGGGTGTTAAAGATGTAGCACAGAGCCCAGACACTCTCAACGACAATCAAGAGGCAGGAAAGGAAAGAGAAACAGCTTTCTCAGATGCCCGTCACTGCCATTTGAAAATGTGTCCGTGTATCCTCTAAAATTAATTCCTGTAGAAGAAGAGAAAGAAGGCACAGGCCTTGGGAAATAAGGGACAACTATGCAGGTACCAGGGGCCAAAGAAGCTTTTTAGCTGAGATTGTGTTTCTTAGAACCAGTAGCACTGATGACAAATTCCCTCTTGTCTGTCTGATTCACCCCTTTCTTGAAGAACCTCTCATTACCCACAGGTCCTGCTAACTGGGCTGTGGGGACATTTGATCCAATCAAGTTCAATTAATCAGTTTCCCTCTTATGGGAATCTGGAATAAAGACATCTTG
>piece5
CGTGTATCCTCTAAAATTAATTCCTGTAGAAGAAGAGAAAGAAGGCACAGGCCTTGGGAAATAAGGGACAACTATGCAGGTACCAGGGGCCAAAGAAGCTTTTTAGCTGAGATTGTGTTTCTTAGAACCAGTAGCACTGATGACAAATACCTTCTTGTCTGATTCACCTCTTTCTTGAAGAACCTCCCATTACCCACAGGTCCTGCTAACTGGGCTGTGGGGACATTTGACCCAATCAAGTTCAATTAATC
The reason I am trying to avoid multiple alignment is because of this possible scenario. Suppose I have two contigs, and one contig shares the last 10% of its sequence with the first 10% of another contig. Multiple alignment would fail to stitch the two contigs....