Dear all,
Maybe its very basic question for you all.
I would like to perform 2 way ANOVA test for my NGS time series counts data. It is raw data counts by HT-Seq. I have 2 condition and for each corresponding diff. time points and their counts. Suggestion is to perform 2 way anova test. but problem is, I don't know the better way to prepare file for that in order to use R function like aov()
. Help me to do so....
My raw counts (without normalization) like..
gene_name counts_value
(for each condition and corresponding time points (36 sample in total) )
gene_1 2349
gene_2 3462380
gene_3 0
gene_4 123
Query:
- How to prepare (format of input) input file in order to use R functions
- Is there any package (bioconductor or R) to perform this analysis
- May I have to normalize this raw counts before performing 2 way ANOVA
Thank you sir for your kind advice but before analyzing differential expression gene analysis, I just wanted to use 2 way ANOVA test in order to investigate and model the relationship between a response variable and predictor variables. Just short of preliminary analysis.
and About DESeq2, indeed i am used to with this amazing package, not for time series analysis but for another standardized experiments (2 conditions and their replicates).
You still don't want an ANOVA then. ANOVA's require continuous variables, which you don't have (you have integer counts). You could
read.delim()
the count files in,cbind()
those to create a matrix (make sure to remove the last 5 lines) and then useglm.nb()
fromMASS
. You could instead use a glmm if you really wanted. Either of those would be better ideas than an ANOVA.