rRNA and tRNA removal from mouse Ribosome profiling data - getting correct sequences
1
0
Entering edit mode
5.6 years ago
Beginner ▴ 80

Hi,

I need to filter rRNA and tRNA from a mouse ribosomal profiling and RNA seq datasets. Am I right with the assumption that since for mouse there exists the "complete repeating unit of Mus musculus ribosomal DNA" as found here, (https://www.ncbi.nlm.nih.gov/nuccore/BK000964), I can simply download the FASTA file, make a bowtie index out of it and align it to it? I also found the Silva rRNA database https://www.arb-silva.de/download/arb-files/ . Does this include the same rRNA sequences that is also included in the repeating unit file from NCBI? Should I prefer any of them?

I would construct a bowtie index with:

$ bowtie-build species_rRNA species_rRNA.fa

and then exclude the mapping reads with:

$ bowtie -p4 -v3 species_rRNA my_reads.fastq \
         --un my_reads_rRNA_unalign.fq >FileLocation

Then I would repeat the same for tRNA. I would get tRNAs from http://gtrnadb.ucsc.edu/genomes/eukaryota/Mmusc10/ under the FASTA link on the left side.

Would this be correct? Would I miss some sequences, especially some rRNA sequences, including rRNA sequences not annotated in the "complete repeating sequence" from NCBI or mitochondrial rRNA?

What about snoRNA or other RNA species. I found that most papers only excluded rRNA and sometimes tRNA but most of the time no other RNA species.

Thank you for your help!

rRNA tRNA Ribosome profiling Ribo-seq • 3.8k views
ADD COMMENT
0
Entering edit mode

You can use bbsplit.sh from BBMap suite to bin these sequences out (A: Filtering rRNA from RNAseq data ).

That said if you leave those sequences in can you exclude them downstream in the counting step?

ADD REPLY
0
Entering edit mode

Sorry for my delayed answer. Thank you for your suggestion and I will try this. Still I am not sure about the differences in "complete repeating unit of Mus musculus ribosomal DNA" and the Silva rRNA database. E.g. does the "complete repeating unit of Mus musculus ribosomal DNA" contain the 5S rRNA?

Also, is the source for tRNA the right one to use? What about snoRNA or other RNA species? Are they commonly excluded?

ADD REPLY
0
Entering edit mode

Did you ever get an answer? I have the same question now. I think the issue with excluding them in the downstream "counting" step is that in ribosome profiling the bam file is used to call p-sites and periodicity, which would be affected by the huge (30%+) portion of rRNA/tRNA.

ADD REPLY
0
Entering edit mode
5.5 years ago
Beginner ▴ 80

Maybe there is any other suggestion, especially regarding the sources (and possible differences) for rRNA, tRNA, soRNA or other RNA species?

ADD COMMENT

Login before adding your answer.

Traffic: 1618 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6