Hi everyone
In RNA-seq analysis, we need to separate samples into two groups for survival analysis. How can I define high level or low level for a gene according to counts or FPKM. Use median? average or quantile?
In TCGA or Oncomine, how are they define the cutoff for a gene ?
Thanks.
Thanks for your advice.
Hi Kevin,
I would like to look at survival of a specific gene between high and low expression tumor samples. For this, I want to divide tumor samples into high and low. I have counts data and transformed them into z-score.
Do you think the below one is right way to divide the samples?
Or Should I follow this high and low samples separation
In that they used FPKM and took the median and based on that they separated samples into high and low.
Which one is the right solution?
Hey bro, There is no right or wrong. Your function looks okay!
Thank you very much Kevin.
Dear kevin, i have used median cutoff via the code below, but for quantile how can i change the code?
it did work with
quantile(clin_df$gene_value)
, but with warnings and i don't think this is the right way, and adding "Mid" is also face with error. thank you kindly.