Hi all,
I have some RNA-seq samples from multiple glioblastoma tumours that I'm now trying to classify according to a specific gene signature (from Verhaak et al., 2010) using R. The gene signature is reported as a gene list with specific centroids for each of the 4 clusters (https://api.gdc.cancer.gov/data/941f81a1-05d7-4f84-80ec-534b8dc1ebac). I'm wondering how I can use this signature to classify my samples in R? Would it involve some sort of k-nearest neighbours method?
Additionally, the signature was identified using microarray data, but I am classifying RNA-seq data. Is there any sort of adjustment I should make to the signature to account for this?
Thanks in advance!
Hi garrettbullivant, You may already have seen this paper but if not for your last part of the question, this paper might help. They took the RSEM rnaseq and microarray values and standardized it and centered around mean.( I personally have not done this yet although I am still trying to recreate some parts from this paper as an exercise. But when I saw this blog I thought it might help you!) Description of thiscan be found in the BRS section of the supplement. DOI: 10.1158/1078-0432.CCR-18-2953 https://clincancerres.aacrjournals.org/content/25/10/3141 Comprehensive Genetic Characterization of Human Thyroid Cancer Cell Lines: A Validated Panel for Preclinical Studies
IƱigo Landa, Nikita Pozdeyev, Christopher Korch, Laura A. Marlow, Robert C. Smallridge, John A. Copland, Ying C. Henderson, Stephen Y. Lai, Gary L. Clayman, Naoyoshi Onoda, Aik Choon Tan, Maria E.R. Garcia-Rendueles, Jeffrey A. Knauf, Bryan R. Haugen, James A. Fagin and Rebecca E. Schweppe