Dear all,
I am working on an organism that has 10 copies of rRNA genes on its genome. I would like to see the percentage of reads that are aligned to ribosomal RNA.
After alignment, rRNA reads align to every and each of these genes and if I perform the counting for reads that are not uniquely aligned to the genome, each one of these reads are counted 10 times. How can I count each read only once?
Any idea is highly appreciated.
Thanks
Extract reads mapped to the rRNA. Do a
grep "^@SEQUENCER_ID" | sort | unique
to get the unique Fastq ID's. Find the % comparing to total number of reads.Not sure what aligner you used but if you want to align a multi-mapping read only once then you can use
bbmap.sh
optionambig=random
. This will select one top scoring site randomly allowing you to count a read only one time.