Splitting a fasta into desired nucleotide length
1
0
Entering edit mode
5.1 years ago
baurumon ▴ 30

Hello,

I want to split this fasta to my desired nucleotide length. In my case, i need to split every 2500bp length. Let me clear a bit more, Suppose, A fasta file has 5934bp length, i want split it a way that give 2 fasta having 2500bp and rest 934 in another fasta file.

I have tried pyfasta program but it does not work in this way.

Please suggest me a way to solve this issue!

Thanks in advance!!!!!

   >NC_1342:::::: TCCGCCAGTAGAGAATCGACCTGTGGGACTGATCATTGGAGTCATCGTTGGTCTGTCAGCTGTGGTGATTTTTGTGGCCTGTCTCATCTATAAAAATCCCGAAAAAAGACAAAACATATTTCTTGTTCCATCTTGTCACGAACGGATACGACATTGTTTCTGTAGACGAGCAGGGGCTCATGTAACATCGGCAGTAATATAATTTCCCTTCCCCTTCCCCTTTCATCCCCCTTCTTGAATTGCCACCTTATCGTGGTGGAGGGGTTTGTGAGCTCAGACGATCCTGGGAGCTGAGTTGTCTGGAGCGTTCAGCTCCTGGTAGGGTCTCCCATGACAAACAGTTCCCAGGTGATGAGCCAGACTAAGAGCAATTCAAAAAAACATTGTCCTGTTTGTACCAATCATGTGGACAACAGCTAAATCTGATTTAGCGACTTAGTGTTACCTAAGACATGATTGCGTATTATAAAAACCTGTGATAATGAAAGGACAATATGTAACTGTTAAATGTAATAAAATGAGTTAGACTTGAAACACGAGACCACAGAGTTCTGTGTGAACACACCAACATGTTGGCTGGTGTGGACCAGGTAAAGACAACATGTACTCATACAAATATCAAAGATGAGTAGCTCAATTACATTTTATGCAAAGGACAAGTACTCCAGCTGGCTTCTGTTGTTACCTGGTGTTACTTAAATATAATTGTATTGACAGTATTTGAATTTACTGACACAACTTCTTTTTTTACTGTTTACACGTCAAATCTGTATCAGGTATCCTGTAAATTCAGCTGTCATAATAACAGCACTGAAGGTAAAACAGCTTTCTATAGACTGGGACACTTCAGGCTAAATTATGTGTTCACTGCATCTAAAAAACAATAATGATGCTAATGGTTATTACTGTGTTTATTATCAGGTGATTTGTATGTTTATGCCATTAAAAGATGGAATCAAAATACTTCTAATGGCATTTATTGTACAGTTTTATTTATGTTTTATTGTTGCTGTGCACTGAAAGCAGCAGTTTGGGAATCAGATTGTTTAAAATGAACTTGATGCAAAAGTGTTTTTCTCTTCAGTACTTTGTGTGATCCTGATACTGACTGTGATTAGTGTGACTGGAGCCATGGTTGGTTTTCTGTCCTGCAGAACTGTTAATAAAAGCTGTTTGTTGCATGTTTGGTCTGTGTGTCTGGTCTCTCTGGTATCAGTAACTCAGATTATCAGTGGTTAACCCTTTGGCTGCTGGTTATTGGTGCTATTCTATCACACCAGAGAATACGAGGGAGAGAGAACCAGGATGTTTATACTGCAGTTGAATAGAAAGATGACACATCCAGGAAGTGCAGGGAAACATGAGGAAGATATAGATTTTAATGAAGGTGAGCTAAGAGACTTTATGAACTATAAGACCTCTTAACCAGTGAATCCACGGTTATTTAGTTTGACTAAAACTAGATGTGGGGAAAAAACCCAGAACATTCTGGTTAAAATACAAACTAAACTATTGGGGGGAAAAAATAAAAGTGATCTGTTTTCTTTCTTTTTTTAACTTCAAAAATTATTATTAACTACAATAGATATACAGAGGAAGTTAATTTTATTACCCTAAATTTTAAAAAAAAATCTGGAATTTTGAAAGTTAAGACTTTCAAATCACATGAGCCAAAAAAATTTTCACAACAGAAAATAAATAACAAATGTTAGAACTGAGAAATTTTACAATCCAATCATTTCAAATTTGATGCCTAATGTAAGTCTCAAAAATGTTCTTTTAGACAGGCAGAGACCCTCAAAAGACACTTTTGAAGTGGACTGTTTCAAAGTGGGAAAGACTTCTGTGTTCAGACAAATCCAAGTTTGACATTCCTGTTGGTAATCACGTACGCCACATTCTCCGAGCTAAAGAGGAGGGAGACCTAGAGTTCAGTTCAAAAGCCAGCATCTCTGATGGTATGGAGGTGTGTAAGTGCATACAGTATGGACAGCTTACATGTTTTGGATGGAACTATGAATGCTTAAAGTTCTATAAAGGTTTTAGAGCAACATATGCTCCCCTCCAGATGACTCAGGGAAGGTCTTGTTTGTATTTGAGCAGGACAACACAAAATCATATACTGAACCTATAGCAACAACATGACTTTGTCCGAGAAGAGTAGAACGAGCAGCTGTTTCCATTATCTAAGGGCTTCTTCACTGTGTTTCAAAAGAAGCTTCATGAAGGGGGGAAGGGAGCAAAGTTATGCTTCACAGCATCAACCTCACTGTGAGGAAAGAGAGTTGAAAGGCCTCCCACAATGACTGTATCAGATGCATCTCATTTCAAAGCTATGATGGTCTGATCCAGTTAAAGAGCCAGGCTCTGCTTACAAGCTGCACTTATCTTCATCAAAGACAACCTCAAACTATTCAGCTGCTAGAAACCTTTATCAAAGCAACACTAAAACTCCAGAAAGTCATAAGTTTGATGCTCAGACACCTTCAAACTGATTTGAAAAGAAGAGGACAGGCTACACCCTGGTAAACTGGCCCCTGTCCCAACTCTTCTGAGACCTGTAGCAAATTTGAAATGAGCTCATTTAGTGGATGAAATCGTAATTTTTTTCAGTTTAAACATGTGTTATGTTATTCATGTTAAATAAAATAAATAAAACATTGGTTAATTTTGTTCAAATTTAAAGAATGGCCCAACTATTGTAGAATTCAGGTTGTAGTTTTGTTTGGTAAAAATGTTTTAACTGCCTGATGAGGCACTGATGGGAGTGTTTACTGACTTATTACAACTGTGAGTCAGACTGAGCCTGTGTAGATGCCTCTGTGTCAGGAAACAACTTTAACCTGCTTTGGGGTCAAGCAATCAGTTACTGAGAAATTTGCTTAGTACTGCGGAAATATGAGATTAGAAAGGAAACACAACTCACAGTGTGCACGTGAACACACACGCATTCAGACCACCTTCACTTTACTGTCATTAGTTTTATGTTGTTTGCTGAACCAGCTTTCAATTCGGATTCGGGAGACAGACACTGTCTCTACTTTTAAGATTAGCCTTAAAACTTTCCTTTGTGATAAAGCATATAGTTAGGACTGGATCAGGTGACCCTGAAAGTGCAGGCTGCTGGGAGATTCCCATGATGCATTGAGAGTTTTACTCTTCAGTCATGTTTTTCACTCAATGTTCTTACACACAACTCTGCATTTAATTATTCGTTATTATTAATACATGGCTGTCTTAACCACTGTGGTTTTGTCCTGTCTTAGTCCCCTCATCCCAACCCAGTTACAGTTTTTCTCACTTTCTCCAAACCTACACATTAAAAATGCTGTTTTTTTCATAACTCAACTCCTGGGTAAAAGCCGTAGGTTCGTAAGTTGGGTTAGTTTGACCAAATATTGGTTCAAAAGTTGTGACTCAGGACTTCTTGATGCATGCTCAATCATCCAGGTAAGTAAATCTCACTCAATCTCAGCTGACTGCAGGTTTCCCCAACCTTATAAACAGTATTTTTGGCCGCCATGTCTGCAGCCTGCCAGGCACTCAAAGAAAGCTGGACTTCATTGAGATACCAGCTATTTGCTCTACGGAGGAGTCATATTAACAACATTCTCACTAGATTTTGTTCTTTTTACTTTTAATCTATTGTCTTGGCTATTTTACTAAGTCTCTGAAACGAAAATGTATACAAATGGCCCACTTATTATAAATTCCCTGGCAAATAAACGAAAGGAGAAATGGTTTGTGTATCAGGTACTATTTAGACTCAATCCTGTTGGCATCATTGTGTCCTCTATTTTTAATTAAATGCATATTTAGCTCAGGTTAATATCGATCTTCTAATATCTTGATCAGATTCCACGCACACTGAGTAAGTCATTATGCACATAAGACACGGCATCCACAGGAGTGCACAATAAAAACAAACACAGTCGGATAAAGTTTACTTGTTGACTCTGAGTCGTGTTGGTCAGAATTCGCTTATAATAATTTAAGACATTATCTCTTTTGGGTTTTCTTTTATATTTAGAAAAGAAAATAATTTAGTCTTTCGTTTAGTTGCTTTTCATTTTATTTATTTTCATAACTGTAATTAAAATCAAAGAAAATAGTTCTGCGCAGTCGGTGTGGGGATCTGACGTTTCAATATCAATCCGCAGACTGGTATAAAACCGCAGGAGCGCCAGAAACAGACTGTTGCTGTTCCGTCTGTGTTCTGTGTGTATAACTGAAGGTTGCACTAGTTTGGTCTTGAATTTTGAGTGCGTAGTCTTTTTTTCATAAAACCCGAAGCTCGTCTACACGATCAAATCAGGAGACATTGCCCCTAAACTATCGGACTGATTCGGAGAGTGCCTCTGTTCCACTAAATTAAGACTAAAAGGTCAGCCTTCTGTGTACGCACAGAACATGGATGACATCATCCGCACAGTCTTTACGGCTGTGCTATGGACCCTTGCCTCATCCCACGAAGGTAAGTAGAAACCACATGTTTAAGTTAGAAGGCTTTTGTTTTCTGTGTAAAAGTTTCTGTAAACGGCCTGCTGGTTCTTGGATTTGCAGAGTCTCTATTCACTATTTTTGTAACTATTTTTAGCTCTCAGTAACCAGAAGGCGAGCAGAGACGTGTGAATTATTGTTGGTGTCACGAAACTGCGTGAGCTGCAAGCCTCAGCTTCGAGAAGCTTTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATACTAGATATTGTGGTAAGAGAAACCCACACCATATTTCAGAATAAAAGCCCTTGGGTCTCGCCTTATTTTATTTTTTTTCCCCGTCTTATTTAAAAATGATATTAAAAGTGAACTTTCTGTTTCCACAAACTAATTTCACCCTACGTGGACAGTACACATTTTCACAGAACTACTATGGCAATATCAAGTCAGTCTGATTTTTATATTTCAAAATAAGAGCCCCTGGAACGTGCTTATGAGCCATTCTTTGAATTCAGAATGTCCTGTTTATACAGACTAATTTCACTTCAAATCAACAGTACACATAACAGTGCTCCTGGATTAATGTCAGTCTGGTTTTGGATTTCAAAGTGAGAGCCATGTGAAATTGCCTGTATACGTATATTATATCTTTATTCTCGCTCCAAATGCCATTAAAAAGAAACTTCCTGTTATTTATTTTGCAAGTTTATATATAAAGCTTCAATCTAAGTGAATGTTGAGTGAGACAACTTTAATTTTTTTAAAAAGATGCCGAAATATGCTCCAACCAAGAATAAGCAAAGAAAAACAAGAGCACTGTCCTGGACCGCCACTGTGCCTCAGAGCAGCAACACCTTTCACTTCTATGTTTTTGTCCTCCATTTGGTTGGCTCCTTTTTTTTTTTTTTTTTCTTTGTTACTGGGAAAATTTCCATTTTTGCTTTTACTGAATTCTCGCAATGTTTTCAGCTGATACCTCTTTGAGAGCGGCGTCTCTCTCTGATATCAAATTGTGTGCCAAACGTCACTGTCGGTGTCAACTCTGCCTCCTGTGCTTCAGCGGAGCTTGGAAGCCTTTCTATGCTAAAACATTTTTAACCTCTGCTTTGTTTTCATTCTTTCAGACCACAAAAACATCACAATTGAATCTGGACAGAACCTCACTCTGCCATGTCGAGTTTCAAACATGTCCTTCATAGCCATAGAGTGGAGCAGATTTGACTTAAAGCCAGAATATGTACTTTTAAGCCGTGACGGGCACTTTGACCCACACAACCAGCATTTATCTTTTATGAATCGAGTTGATCTGCAGGAAAGAAAGATGAAGGATGGAGACGTGTCTTTGATTCTGAAGGACGTGACGACTGATGACACTGGAACATACGAGTGTCGTGTCTTCATGGAAGAAACACGCTCATGGAAATCCATCAGCATCATCTACCTGAGAGCTGTTGTTCCTCCAG
sequence • 1.5k views
ADD COMMENT
1
Entering edit mode
5.1 years ago
ATpoint 85k

Please start here and then use the search function: Digesting Fasta Sequences Into A Set Of Smaller Sequences

ADD COMMENT
0
Entering edit mode

it worked, thank you !!

ADD REPLY

Login before adding your answer.

Traffic: 1877 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6