Hi all,
I want to remove duplicate reads from my fasta file. I tried to use fastx_collapser. But since my reads contains lowercase letters and hyphens it failed.
Please help.
Thanks,D.
Hi all,
I want to remove duplicate reads from my fasta file. I tried to use fastx_collapser. But since my reads contains lowercase letters and hyphens it failed.
Please help.
Thanks,D.
Try the sequniq tool from the GenomeTools suite:
gt sequniq -o output.fasta input.fasta
Here is my free program on Github Sequence database curator (https://github.com/Eslam-Samir-Ragab/Sequence-database-curator)
It is a very fast program and it can deal with:
It can work under Operating systems:
It also works for:
Best Regards
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
duplicate of:
How to remove the same sequences in the FASTA files?
It's like everybody wants to remove duplicates here!