I am working on an RNA-seq transcriptomic dataset. I have two ways to identify the important genes-
- Differential expression using DESEQ2
- Comparing absolute expression levels of tumor to normal samples
In the second case, I am aiming to determine the genes that are expressed in tumour samples over the maximum threshold of expression in normal samples to give me genes that lie outside the range of normal in more than 80% samples.
Both approaches give me different results. What would be better to implement? Let's discuss.
Remember that RNA-seq data are compositional, and you can't treat the count data as absolute expression values.