I'm trying to align a string of twenty 5-mers with gaps of 100 to 400 bp to a reference genome. It's similar to paired-alignment except instead of N=2, N=20. The order of the twenty 5-mers is known.
Is there any software that can handle a data set like this?
Best regards,
-Sam
Yes, it's similar to Complete Genomics, unfortunately, they've not released their aligner and there's no aligner I've found that can work with their raw data. Thanks for the PacBio tip. Their 'short' reads are ~700 bp so it looks they just use BLAST, which won't quite work for my tiny 5 bp reads.