miRNAs detected in bulk mRNA-Seq data - how to annotate?
1
0
Entering edit mode
5.4 years ago

Hello,

In our bulk mRNA-Seq data, about ~600 of our ~21,000 detected genes were miRNAs. All of these fall within the bounds of expression of non-miRNA genes, and about ~20 miRNAs fall within the upper half of gene expression in the dataset.

I was surprised by this because I thought most miRNAs would be removed via polyA selection. Also, we did NOT use a kit for small RNA capture and sequencing.

I'm wondering if these reads are aligning to a pre-miRNA that are longer than 75 basepairs. I'd like it if I could take my list of miRNAs in R and combine them with a database that has information about how long the pre- and post-processed miRNAs are, to sanity check my theory.

Does such an miRNA database exist? I'm trying to use mirbase.db, but I'm confused about its use, as well as whether it has the information I'm looking for: http://bioconductor.org/packages/release/data/annotation/html/mirbase.db.html

Thanks!

RNA-Seq miRNA R • 2.3k views
ADD COMMENT
0
Entering edit mode

This is a great point. Thank you

ADD REPLY
1
Entering edit mode

I suggest you try RNACentral. Filter according to your needs (organism, type of RNA).

ADD REPLY
0
Entering edit mode

I'm a little confused...was your prep designed to catch miRNAs or not?

ADD REPLY
0
Entering edit mode
ADD REPLY
0
Entering edit mode

You wrote "bulk miRNA-Seq" but I don't think you meant that. I think you need to scrutinize the library protocol, or talk to whoever prepped it, because I strongly suspect that this library prep is supposed to filter away small fragments, so you can't treat the things you think are short fragments as legitimate.

ADD REPLY
0
Entering edit mode

Thanks for catching that typo, fixed above. I'm wondering that too, whether pri-miRNAs are even a possibility in an mRNA-Seq prep.

ADD REPLY
2
Entering edit mode
5.4 years ago
Asaf 10k

mirbase.org -> Download -> hairpin.fa will give you all the sequences of pre-processed miRNAs, you can then filter them according to your organism and map the reads against the resulting database.

ADD COMMENT
2
Entering edit mode

Be on the lookout for U in the sequence if you get data from MiRBase. You will need to change those to T before aligning.

ADD REPLY

Login before adding your answer.

Traffic: 1995 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6