Normalisation of RNA-seq for different RNA biotypes
1
0
Entering edit mode
8.0 years ago
Nathaniel ▴ 120

I have RNA-seq count data and I want to perform traditional analysis like PCA/heatmaps on each RNA biotype independently.

My question is: when performing the RNA normalisation step (using for instance DESeq) should I do it on the entire gene expression matrix with all biotypes concatenated, or is better if I do it separately for each gene biotype?

What worries me is that if we do the processing in the concatenated expresion matrix, the mRNA will completely mask the miRNAs, since they are much more expressed. Also, if i then filter let's say the 10% less expressed genes I will probably end up filtering more miRNAs than mRNAs for the same reason.

Thanks.

rna-seq deseq • 1.9k views
ADD COMMENT
1
Entering edit mode
8.0 years ago

Estimate the size factors and dispersion using the whole dataset and then subset for further comparisons. Your miRNAs won't be masked then or filtered out. Having said that, if you did RNAseq then you're going to get largely meaningless miRNA counts. I would strongly encourage you to simply ignore them. If you want to measure miRNA differences, sequence small RNAs. Otherwise you're just measuring differences in size selection during library prep between the groups...

ADD COMMENT
0
Entering edit mode

Great, thank you! does the same apply for long RNAs such as lincRNAs? Those should be safe to analyse, right?

ADD REPLY
1
Entering edit mode

lincRNAs are fine if you did ribo depletion rather than polyA selection.

ADD REPLY

Login before adding your answer.

Traffic: 2087 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6