Hello,
I have one question regarding the approach of differential gene expression analysis from RNAseq data of cancer.
Approach:
1) To find out the Defferentially Expressed Genes between low survived and high survived samples of patients. (example: here in my case, patients survived less than 1 year and patients survived more than 3 years)
I am using GDC data of lung cancer (TCGA-LUAD) and using this above approach i want to find the DEGs. So for this analysis i am using and following TCGAbiolinks package and its tutorials. but even if the TCGAbiolinks analysis is giving the DEG number but the heatmap of those genes is not showing any differentiating pattern. So i did the same analysis using edgeR analysis and limma using the expression set obtained from summarizedExperiment for the DEG, but then this analysis doesn't give any DEGs between less than 1 year and more than 3 years samples of patients.
SO my question is,
1) Is this approach of finding DEGs between less than 1 year and more than 3 years is correct? or is there anything i am missing in the analysis?
Suggestions regarding this approach will be very helpful and it will surly help me to clear my confusions. I look forward to hear valuable suggestions.
Thank you in advance.
Can you please detail further how you used edgeR? and the connection to limma..
Hello roy.granit,
here i provide with the R code i used for the DEG analysis usign edgeR, limma,
I have clinical info from the datatype follow_up and i have made two files containing patient clinical data for days to death more than three years and less than three years. the following code i used to match those barcodes of patients with the sample barcodes.
Following is the RNA seq analysis, this im following according to this tutorial ( http://combine-australia.github.io/RNAseq-R/06-rnaseq-day1.html )
The topTable gives zero DEGs If this process to find DEG is wrong then please let me know that, it would really help me.
Sorry to add this code as a separate comment, but the number of charcters in the post was exceeding than the 5000
Below is the code where i have followed TCGAbiolinks tutorial,
these two methods i used to find the DEG, if there is anything wrong with one or the method please let me know as this would help me clear my doubt. And please tell the approach I'm using to find the DEG is correct or not correct.
Thank you so much for the help.
in this code :
can anyone help me ?
That error says you dont have "
luad.exp
" object in your environment, so R complaining and says it can not save an object which does not exist.Also since you specified
"save.filename = "TCGA_LUAD_Exp.rda"
in your commandluad.exp <- GDCprepare(query = query.exp, save = TRUE, save.filename = "TCGA_LUAD_Exp.rda",summarizedExperiment = TRUE)
, there is no point to dosave(luad.exp,file = "luad_exp.RData")
at least at this step.