Hi there,
I wanted to perform DE of lncRNAs and mRNAs from the RNA-seq data using DEseq2. I am wondering if it is a good idea to have separate annotation files for each and then perform the differential expression (DE) or is it OK to perform DE analysis using the annotation files that contains both lncRNAs as well as protein-coding mRNAs and then filter based off the gene-biotype?
Your response or insights are highly appreciated. Thanks very much!
Regards, Bhumi
It will be a good idea to process them separately starting from alignment to DE. The following research article suggests a similar approach.
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0150628
My preference is to keep them separate and to conduct the analysis separately. Protein coding mRNAs are typically expressed at higher levels than ncRNAs, with many ncRNAs being expressed at very low values (if at all, in your tissue of interest). For that reason, I feel that normalising them together would in fact bias the results, but I don't have data to back up this hypothesis.
I would be interested to hear other ideas.
Thanks all for your responses. I do agree that performing lncRNA and mRNA analysis separately is a better approach compared to processing them altogether.