Hi,
I have a DNA sequence contains a tandem duplication (the sequence of duplication is unknown) for an unknown times.
Each subsequence has some mutations, suppose repeat unit is A (length is 1~8K ) then the sequence is A1A2A3....An(A1 and An possibly are not complete).
If I want to find out the duplication unit A, how should I do?
Thanks.
If you are interested in finding exact number/sequence of the repeats then using a long-read sequencing technology would be the ideal way.
If you already have sequence data then please specify the kind.
Yes, I have some rolling amplified nanopore sequencing data and want to separate the repeated unit then generating a consensus single reads for a better accuracy.
You should have included this information in the original post.
Even if
ReDTandem
is not supported you may be able to use it. If it is not meant to be used with long reads then that would be an exception.Are you using a custom protocol to do this sequencing. What do you mean by "rolling amplified" data? AFAIK there is no rolling circle like sequencing in Nanopore as there is in PacBio.
Thanks,
ReDTandem can not be downloaded anymore since the author closed his website
“Rolling amplified” means the target DNA is amplified several time to be a tandem sequence, then separate these tandem copies after nanopore sequencing, I could take a consensus step to increase the target DNA sequence accuracy.
Hi,
actually, what I exactly wanted is a massive sequencing reads of rolling amplification sequence, is there any bioinformatic tools for is propose?
I find one named ReDTandem but the author have not supported it anymore.
Thanks