Hi:
Given TCGA RNA-Seq data (level 3), how one can do single sample GSEA?
For example, TCGA RNA-Seq data has
Gene Raw counts RPKM
ALK 434 2.3
.. .. ..
Can we create a GCT file of Raw counts or RPKM to ssGSEA in gene pattern?
Thanks
Adrian
Hi,
ssGSEA in comparison with GSEA calculates separate enrichment scores for each pairing of a sample and gene set.
However, both need the .GCT files as an input. From the TCGA you need to download the level 3 data, however it has to be expression. So, after some preprocessing your .GCT should look like this:
I think that there is no possibility to calculate those statistics from raw counts and rpkm. Raw counts may be biased, so I think that you should consider some normalisation step for such analysis to make it comparable with other studies.
Best regards!
Could anyone explain how ssGSEA process the gene expression data to rank the gene for each patient sample? Dose it need to be compared with the normal sample?