I would like to analyze the following TCGA dataset: TCGA-LUAD. I would like to see if several genes of interest are correlated with a better chance of survival in patients with LUAD.
I managed to retrieve the information needed using TCGAbiolinks. However, after running GDCdownload & GDCprepare, I am a bit lost on how to proceed forward.
There are different approaches that you may follow e.g. voom function from limma package to normalize the data.
Check out this repo, it may help you with your analysis.
With regards to the type of data, you can start with counts coming from STAR, then proceed with normalization as indicated in the abovementioned GitHub repository.
In terms of conditions, my understanding is that you want to do a survival analysis between groups of samples showing "high" and "low" levels of expression for a gene of interest. So this can be a column in your data frame and would be the grouping variable in your analysis.