Entering edit mode
4.7 years ago
anasofiamoreira94
▴
80
In my faculty, we use ion torrent. After the trimming and fastq analysis, we perform clustering with usearch based on the identity of the reads length. However, I want to know if there is a new way to perform clustering. I'm sorry if this is a not so interesting question. Thanks
What are you clustering? Please add details.
I assume you are clustering reads to identify sequence duplicates. Take a look at
clumpify.sh
and its features here: Introducing Clumpify: Create 30% Smaller, Faster Gzipped Fastq Files. And remove duplicates. If some of reads are expected to be shorter (i.e contained in others) then you will need to usecontainment=t
option.I will see your suggestion @genomex, thanks!