What I have seen previously in papers is that the authors find genes that are highly correlated with tumor purity.
I'm mainly confused in Paper #1, when author says:
"Spearman correlations were calculated between their expression and tumor purity to generate a ranked list of genes."
If you have a matrix consisting of 50 samples where they are separated into groups of 10 according to the tumor type (5 groups) and 20,000 rows for genes, how do you perform correlation analysis with 50 tumor purity values for the 50 samples to identify the genes that are highly correlated?
Thank you. Your posts are always helpful.