Hi.
I am basically a biologist and very new to RNASeq/bioinformatics. We just got our RNASeq results. We had two groups. Cells infected with virus and uninfected control group (n=3 for each group, biological replicates). Our infected vs control are not separating in 2 distinct groups when PCA is performed. There are no DEGs when adjusted P values are considered. P values are significant and there is log2 fold change for some genes. Using the same gene list, I ran GSEA (using galaxy server, standard parameters) and found some enriched hallmark pathways with significant adjusted p value.
I am totally confused if this all is making sense at all? Should i consider my GSEA results reliable?
Again, I am very new and might need a very basic explanation to understand what is happening here.
Thanks.
Lalani
Hi.
Thanks a lot for the response. So to answer your questions, this is what galaxy says:
Currently the egsea.cnt function is implemented in this tool. This function takes a raw RNA-Seq count matrix and uses limma-voom with TMM normalization to convert the RNA-seq counts into expression values for EGSEA analysis.
Yes, the results do make sense as in some viral infection we do not expect transcriptomic storm. Rather, it balances the infectious states by adjusting the expression of group of genes to stabilize/combat the virus attack. I plan to pick one of the pathway which is of interest to me and verify the gene expression by qpcr.
So in this case, my only worry is, would it be questioned if i will use gene list which is not differentially expressed (non-significant) for validation by qpcr but has impact on pathway?
Thanks
This really depends on how you frame it. Finding zero DE genes is pretty rare, particularly for an experiment like this. Even just a very low concentration of DMSO will have an impact on the transcriptome in vitro. However, only 3 replicates for each condition may not be sufficient to pick up more subtle shifts in gene expression with low effect size.
I'd consider picking one of the GSEA genesets in your results that you trust the most (or make the most sense biologically), validating a few members thereof via qPCR, and then thinking about if there's any way you can make that response more robust - increasing viral titer, altering the collection timepoint, etc. You could consider doing another RNA-seq experiment with improved conditions (validated by your qPCR genes again prior to seq) if there's funding for it.
Thanks a lot. That really makes sense and it was helpful.
Appreciate your input and help.
Regards, Lalani