Hi Everyone,
I built a logistic regression classifier for predicting genes that has around 75% accuracy. Predictor A is a transcription factor that is a top predictor. However, when I look at the the scatterplot there is absolutely no correlation and the positive and negative examples show the same distribution. Is this still reasonable?
Thank you everyone.
Hi Jean-Karim,
Thank you for your help. I would just like to further clarify, since the logistic regression takes into account the individual contributions of each feature, but I did not add any interaction terms. In this case, I'm not sure if i can say that there is some interaction among the features.