Hi Everyone,
In the following examples we see the second example has better result and more separable.
Is it possible to see somehow find this separation based on the data matrix of PCA result? e.g. based on some kind of score like Mean, Median calculation of the components or any other way?
PCA_example 1:
PCA_example 2:
The code used:
pca <- prcomp(dataMatrix, scale=T)
scores <- data.frame(Groups, pca$x[,1:3])
pc1.2 <- qplot(x=PC1, y=PC2, data=scores, colour=factor(Groups)) + theme(legend.position="right")
Thanks in advance
Thanks a lot. It seems a solution to me. I am going to try that. But to fully understand your point:
1- in-class distance: do you mean to calculate pairwise correlation between PC1 and PC2 for a condition(class) ?
2- out-class distance: this one related to first point and didn't get it actually.
Thanks
Think of the eigenvalues of PC1 and PC2 as x and y coordinates defining each dot in the plot above. The dots highlighted with Red and Blue colors in the plot above are your two sample classes. (1) In-class distance: pair-wise Euclidean distance between each dots highlighted in Red (or Blue). (2) Out-class distance: pair-wise Euclidean distance between a dot highlighted in Red and every dot highlighted in Blue. Repeat this for all dots in Red group. You must follow this procedure for each dot in Blue group vs dots in Red group.