Dear Biostars, Hi
I have used Trinity and its default scripts for my de novo transcriptome assembly (3 biological replication for condition 1 and 3 for condition 2), count.matrix creation (using RSEM) and edgeR (FDR=0.001) for DEG analysis.
Recently I have heard that using GLM method is suggested as an insurance of accurate differentially expressed gene data, but I do not know how to use it and what program or R script I should use and which Trinity intermediate files I should target?
I have read the description of such analysis in a paper as :
"We used univariate generalized linear models (GLM) to identify differentially expressed genes in response to each condition challenge for each species separately. Negative binomial GLMs were implemented using the “edgeR” v3.8.6 package in R v3.1.3.....For this analysis, we considered only Log 2 fold changes from the genes that were identified as being significantly differentially expressed individually by each species in the GLMs above. Nonparametric Wilcoxon rank-sum tests were again used to compare the rank order of fold change between species for upregulated and downregulated genes separately in each treatment. "
Q: How can I use GLM for my RNA-seq analysis?
NOTE: I have heard that edgR does not use GLM by default, but DESeq2 does. I ran the same analysis using DESeq2 and all edgR results was contained in DESeq2, too. is that enough?