Hey All I have data frame with 5 Samples A,B,C,D,E.
A is parent (reference)sample and rest of samples are from patients. each row represents a miRNA and value against that row in each column represents Back ground subtraction values of that miRNA in each sample. I want to perform ANOVA test in R. I am bit confused how I should perform either with Parent and one patient sample (A&B) A&C and so on). Secondly most of the ANOVA tests which I saw on google and youtube they have for example one column with data second column with different groups for the value for example
Weight Loss Diet
1.2 A
22.3 A
5.4 C
33.5 B etc
A B C D
hsa-miR-199a-3p, hsa-miR-199b-3p NA 13.13892 5.533703 25.67405
hsa-miR-365a-3p, hsa-miR-365b-3p 15.70536 52.86558 18.467540 223.51424
hsa-miR-3689a-5p, hsa-miR-3689b-5p NA 21.41597 5.964772 NA
hsa-miR-3689b-3p, hsa-miR-3689c 9.58696 44.56490 10.102051 13.26785
hsa-miR-4520a-5p, hsa-miR-4520b-5p 18.06865 28.06991 NA NA
hsa-miR-516b-3p, hsa-miR-516a-3p NA 10.77471 8.039662 NA
E
hsa-miR-199a-3p, hsa-miR-199b-3p NA
hsa-miR-365a-3p, hsa-miR-365b-3p 31.93503
hsa-miR-3689a-5p, hsa-miR-3689b-5p 24.26073
hsa-miR-3689b-3p, hsa-miR-3689c NA
hsa-miR-4520a-5p, hsa-miR-4520b-5p NA
hsa-miR-516b-3p, hsa-miR-516a-3p NA
How I should do for my data
Thanks in Advance
Best
Adnan
Dear Mikhail thanks for your answer.
I don't have any replicate in my data all samples have same miRNAs but the value of expression changed from sample to sample.so If I will make groups then I have 2019 miRNAs in each sample for A group I will have 2019 miRNAs and for B same and so on??
Yep that would be
and so on
Dear Mikhail after melting my data frame thats how my data look like.
and then what I did I compared groups with values by doing
First Question: Do I need to compare values with miRNAs?? after ANOVA test I performed TukeyHSD and results are below.
If I will interpret these results it shows that we have strong presumption against NULL Hypothesis for the groups
D-A
andD-C
. So we can reject our NULL hypothesis and we have ground realities on believing that there is a relationship between these two groups??Best
Adnan
Hello!
Everything seems correct. The anova will show that expression values of your miRNAs of interest indeed significantly vary by patient group in case you get P<0.05. If that is true, you can interpret the results as that you've found a trend between miRNA expression and patient group and post-hoc analysis with multiple testing correction has shown that groups D and A and D and C are significantly different in miRNA expression.
Comparing values with miRNAs is quite odd in your case, basically this will tell if some miRNAs have typically high expression in patients, while some have typically low. Expression of individual miRNAs varies a lot, so I would expect that you're almost certain will see some statistical significance here.
PS I've just realized that a set of 2019 miRNAs could be the whole human mirnome, not just a specific set of miRNAs of interest selected based on prior biological knowledge. Why haven't you used more conventional methods like cluster analysis in your case?
Basically if I'm correct you've found that miRNAs as whole are up/down-regulated, which is weird (unless you're studying some miRNA transcription machinery). You should rather find differentially expressed microRNAs using a package like DESeq, which would also provide means for post-hoc analysis.
Ok now I've confused everyone and myself a little :)
First, group A (parent) should be removed from analysis, as it is inappropriate to use same data for normalization/clustering and follow-up ANOVA http://stats.stackexchange.com/questions/116294/appropriateness-of-anova-after-k-means-cluster-analysis
Second, DESeq is for read counts and the OP is talking about microarray data. Use Limma instead