Ribosomal RNA Depletion in silico

0

Entering edit mode

5 months ago

CL • 0

Hi, currently I have some Nanopore Direct RNA sequences. In the sequencing process rRNA should already be taken care of, although I did not perform the sequencing myself. Could anyone please help me see if the following code can remove rRNA?

# Index for Ribo Genome
minimap2 -d Tair10_Ribo_index.mmi TAIR10_ensembl_rRNA_tRNA.fa -Q -t 3 -k 14
# Map to Ribo, take unmapped reads out (samtools -f 4 does that)
minimap2 -t 10 -ax map-ont Tair10_Ribo_index.mmi nano.fastq | samtools fastq -n -f 4 - > nano_unmapped_ribo.fastq.gz
# Remap with unmapped reads 
minimap2 -t 10 -G2k -x splice -uf -a tair10chr.fasta.fa.mm2.idx nano_unmapped_ribo.fastq.gz > nano_riboDepleted.sam

I have used the code but not a single read has been removed, which is highly unlikely since there should be some rRNA remaining.

Thanks!

minimap rna-seq rRNA nanopore • 760 views

ADD COMMENT • link updated 5 months ago by GenoMax 147k • written 5 months ago by CL • 0

0

Entering edit mode

Are you using publicly available datasets

ADD REPLY • link 5 months ago by naveennavinbeast • 0

0

Entering edit mode

I have some Nanopore Direct RNA sequences

Asking since I have not seen direct RNA nanopore sequences. Does the sequence contain U's in place of T's? If so that may be causing not alignment.

ADD REPLY • link 5 months ago by GenoMax 147k

0

Entering edit mode

Hi, no it is not public data. I used Dorado to do the basecalling, and the resulting fastq file contains "T"s.

ADD REPLY • link 5 months ago by CL • 0

0

Entering edit mode

minimap2 is unlikely to make an alignment error. If this run was from total RNA then it seems surprising that you have no rRNA in your data.

ADD REPLY • link 5 months ago by GenoMax 147k

0

Entering edit mode

Thanks so much! In your opinion, do you think my codes above look correct? (like the logic of mapping to ribosome, taking out the unmapped, and remap to the genome) I would like to make sure of this; if so, I think I probably need to look at my data to figure out where the issues arise.