Hi all,
I just got PacBio reads for several human individuals; however, some of them seems to conform to what their name says (e.g. HiFi), whereas others seems to be CLR. After asking in my lab, my colleagues told me if that is the case I will need to manually filter HiFi sequences out of those CLR reads.
Someone came acorss this issue before? I was wondering whether there is an simple/easy way to do it. Sorry for asking but I'm quite new to the field. Thanks in advance.
Thanks a lot for the clarification! That's exactly the case; in fact, I both have access to the
reads.bam
and to thefastq.gz
.However, without being aware of this procedure I downloaded the fastq.gz file... is there a workaround even with this format, or I will need to download the bam file?
I'm asking this because the next step will be running hifiasm with long range Hi-C data to fully-phase these sequences and I'm not sure on how (and if) the tool accepts bam files.
Thanks again,
Matteo
It's really hard to say without looking at your files.
If you reach out to support@pacb.com, they can try to help answer that question.