Hi there,
Here is a long DNA sequence (in fasta), would you like to show me how to split it into certain length fragments (100nt) with 20nt overlapping? Like following:
Input:
>E.coli
ACTG*****************************
Output:
>E.coli(1-100)
ACTG***********************
>E.coli(80-180)
*******************************
>E.coli(160-260)
*******************************
Thank you in advance!
Would you like to tell us whether you tried to do this yourself, or if you don't know where to start with the problem?
A script, like JC's answer. Thank u also.