Obtaining rRNA and tRNA contaminant file for RiboSeq
1
0
Entering edit mode
5 months ago
Daniel ▴ 30

Hello,

I am trying to obtain sequence files to align my ribo-seq fastqs to (so that I can filter out unwanted tRNA and rRNA). I believe the industry standard is using SILVA's db, but to a new user there is 0 helpful info on the site explaining which files to download or where they even are... Could someone please point me in the right direction?

Thank you.

ribo-seq SILVA rRNA • 801 views
ADD COMMENT
0
Entering edit mode

What organism do you need this information for?

ADD REPLY
0
Entering edit mode

I need it for human Hg38!

ADD REPLY
0
Entering edit mode

You can find the rDNA repeat information in how can i download human ribosomal reference ?

You should be able to use BioMart to get the tRNA sequences as well.

ADD REPLY
0
Entering edit mode

Thanks for sending this but it actually looks like ensembl does not annotate tRNA genes https://support.bioconductor.org/p/66192/. I couldn't find it through the filters either.

I also feel I need a dedicated tRNA file (analogous to the ribosomal repeat file linked by GenoMax in the answer you linked). The reason is when I aligned my ribo-seq reads to ensembl annotated rRNA genes, barely anything aligned. When I used the ribosomal repeat .fa file, I had significant alignment. I feel it must be analogous to tRNA.

ADD REPLY
1
Entering edit mode

tRNA population in RNAseq is likely going to be much smaller than rRNA which can be upwards of 95% of cellular RNA.

If you must get a tRNA file then you can download the sequences from https://rnacentral.org/search?q=homo%20sapiens%20tRNA and create a non-redundant set.

ADD REPLY
2
Entering edit mode
5 months ago
Jack Tierney ▴ 410

I suggest using RiboDetector. This doesn't really answer your question but it does achieve the same goal (for rRNA at least which is typically the highest contaminant).

To actually answer your question you can get rRNA from SILVA. Use the Browser tab to select Homo sapiens then add all human sequences to cart before downloading. You just need to work out which category humans are in. Eg. EMBL-EBI/ENA>Eukaryota>Metazoa>Chordata>Craniata>Vertebrata>Euteleostomi>Mammalia>Eutheria>Euarchontoglires>Primates>Haplorrhini>Catarrhini>Hominidae>Homo

You can download tRNA sequences from gtrnadb.ucsc.edu specifically here.

ADD COMMENT
0
Entering edit mode

Thank you so much! RiboDetector worked perfectly, much appreciated :)

ADD REPLY

Login before adding your answer.

Traffic: 1409 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6