Entering edit mode
6.2 years ago
gtrwst9
•
0
Say I have the file for accession LS483306.1 which is one big sequence starting with
>LS483306.1 xyz
AGCT...
and want to have one file with N sequences of size X, looking like this (example X=2000):
>LS483306.1:1-2000 xyz
AGCT...
>LS483306.1:2001-4000 xyz
GCTA...
>LS483306.1:4001-6000 xyz
CTGA...
and so on.
Is there a ready-made command-line tool for this? Which? I could write a BioPython script but I would like something faster.
then write it in plain python