I want to statistically compare two unit vectors. Is there a statistical test like below?
The null hypothesis is that the two unit vectors have the same distribution. In this case, for the below three unit vectors, nv and nu are very similar, so Test(nv, nu) should produce a large p value, but Test(nv, nw) should produce a small p value that may reject the null.
Please note, this is about vector not about a group of numbers, so the components in the vector are fixed, i.e., Kolmogorov-Smirnov test may not be proper here.
Any suggested statistical test?
nv = (0.29521, 0.61899, 0.72374, 0.03809, 0.066660)
nu = (0.28004, 0.59743, 0.74678, 0.03734, 0.07468)
nw = (0.01467, 0.04401, 0.99752, 0.04401, 0.02934)
Thank you!
How about computing the Wilcoxon's RST (Shapiro–Wilk test is significant for nw so it is not normally distributed)? I didn't understand if your data is paired or not. If yes, use the Wilcoxon's signed-RST.
You should give more information. How are these vectors generated ? Do we know anything about the population they come from ? If the vectors are independent samples drawn from multivariate normal distributions, look for Hotelling's two-sample t-squared statistic.
Thanks Jean-Karim. The unit vector is derived from a 5-category frequency vector.
You've normalized the vectors to length 1 but do the original frequencies sum to 1 for each sample, i.e. are you dealing with compositional data ? Or are the categories measured independently ?
No, the original vector is about frequency counts, e.g. (5, 4, 3, 2, 1) and (50, 40, 30, 20, 10), after normalizing, we treat these two as the same, i.e., their distributions are the same. Second question: Yes, categories were measured independently.