I have a microbiome data set with 960 samples from 269 subjects. 193 subjects have a single sample and 76 subjects have repeated measurements at different time points. The dataset also consists of 70 taxonomic features. The target variable has 4 classes, I want to build a multiclass classification model to classify them.
What steps should I follow to build a model with such a microbiome dataset and what should I consider? More specifically, to what extent should I account for within-subject correlation due to repeated measurements in the model?
When I evaluated each observation independently, I found that the accuracy of the random forest performance was 0.86 on training data and 0.90 on unseen data. Are these results biased?