Hello dear friends.
I'm kind of stuck on some analyses.
I have expression values of a gene in 97 samples. about half of these samples are healthy, and the others are patients.
Now, I want to perform a univariate logistic regression and predict disease occurrence based on the gene expression level. I used the glm()
function as below, but I got a very very huge odds ratio and CI. My data doesn't have any NA value, also I don't think there is any outlier, because the values are in the same range. (all of them are between 8 - 10, maybe a little up or down) . Also, the p-value is highly represents meaningful. I don't know what's the problem and I have searched for that for hours, but I could not fix that. I'd appreciate it if you share your answers.
### FRGmetadata is a data frame in which its columns refer to different variables, and the rows are sample GSM numbers.
model <- glm(formula = status~TXN, data = FRGmetadata, family = binomial)
Utmost sincerity