Integration of two TCGA datasets
1
0
Entering edit mode
3.4 years ago
fifty_fifty ▴ 70

I want to do survival analysis for some genes expression using TCGA data. I want to find genes which up- or downregulation are associated with survival. I am doing it as a validation part of my work which included both LUAD and LUSC cancers. What is the best way to combine two datasets for two different diseases, e.g. LUAD and LUSC? I have got my data from cbioportal

thank you

r RNA-Seq tcga survival • 1.2k views
ADD COMMENT
2
Entering edit mode
3.4 years ago

There is no issue here. You do either of the following:

  1. perform survival analysis separately for LUAD and LUSC
  2. bind the datasets together and perform survival analysis on the merged dataset

There are greater potential criticisms for approach #2.

Kevin

ADD COMMENT
0
Entering edit mode

thank you! I used your tutorial for survival analysis, very helpful, thanks a lot. I performed survival analysis separately for both datasets. I am thinking if it worth combining the data. What would be the biggest potential criticism if I decide to integrate the datasets?

ADD REPLY
0
Entering edit mode

Well, LUAD (adenocarcinoma) and LUSC (squamous cell carcinoma) are different diseases.

ADD REPLY
0
Entering edit mode

I understand that, but they both are non-small cell lung cancers, which I am working with. So, I thought if my initial analysis were done for NSCLC, then shouldn't my validation data include both LUAD and LUSC?

ADD REPLY
1
Entering edit mode

Sure, why not just proceed and do it both ways? - that's how we would do research. When it's done both ways, then we will know if survival differs across LUAD and LUSC. It likely will differ for the expression of certain genes.

ADD REPLY

Login before adding your answer.

Traffic: 1830 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6