Hello everybody. I'm doing my PhD and I'm a bit new in this field. I have the gene expression of a group of patients before treatment and after treatment. I have performed a differential gene analysis and found 1740 overexpressed genes.
After doing this, I filtered out those genes in the patient samples "Before the treatment", with the idea of seeing which of those genes that are overexpressed have an effect in the survival of the patients.
For this I have this kind of table:
Patient | OS(months) | Death | P2RY8 | BRAF | ...
P1 12 1 7.96891 6.9009
P2 32 0 7.51238 6.39389
P3 22 1 7.51238 7.39389
P4 32 0 6.96891 4.9009
P5 24 1 5.96891 3.9009
P6 33 1 6.96891 6.700
Those numbers correspond to the expression levels after normalization with RMA.
I have seen this post by Kevin Blighe , but I don't want to see the effect that each gene has invidually, but a multivariate analysis.
Survival analysis with gene expression
Do you have any idea how I can do this or if there are any tutorials?
Thank u in advance.
Thank u so much for your reply. I'll try it with my data and let you know ;)
You're welcome! You might also want to consider censoring your data. See the blog below for info.:
Basics of Survival Analysis
Here are some online workflows that might help. The first one uses the caret package in R.
Prediction of Cancer Survival
Random Forest: METABRIC