Hello,
I am doing a small assignment for an intro to computational biology class, and I was told to consult here for help. The topic I was given is using the TCGA-BRCA RNA-Seq dataset to classify the tumor samples into their respective subtypes. I know there are papers (https://ascopubs.org/doi/full/10.1200/JCO.2008.18.1370) that have found gene signatures for each subtype, but i cannot find a good way to determine whether a sample that I have (from the TCGA database) matches those gene signatures or even find suitable numerical values associated with those gene signatures (I only see a heat map showing which genes are up and down regulated for each subtype).
I am a beginner, so bear with me here, and please let me know if there are any relatively simple ways to do this! (Even if they may not be the best statistically-speaking.) Thank you so much!
Are you looking for a PAM50 Signature? You may need to find genes used for PAM50 signature analysis?
Thanks! So, I know about the PAM50 signature, but I can't find a good way to classify genes using the PAM50 signature - do you have any resources that allow you to classify an RNA-Seq sample by these signatures? Perhaps not a tool that does everything for you, but something more generic that allows you to feed in a gene signature and classify samples by that. I also need to find a valid input for such a tool too I guess.
Sorry, but helping students in a class is the professor/TA/RA's job, not ours.
Oh, okay - let me just tell them that!