Entering edit mode
8.3 years ago
bioinforesearchquestions
▴
370
Dear All,
I am working on the RNAseq data. I would like to know what is the significance threshold for differential gene expression (10folds, 50folds, 100folds) between tumor and normal sample?
This is precisely the point, one should not figuratively filter on log2(FC) values first. First value of filtering is your FDR corrected pvalues and this conserves the fact os 1% or 5% error rate. After that if you still see too many genes then you can filter out low expressed genes that have very might fold change unless they are not having miRNA as sometimes small changes might also induce some phenotypic changes or induce some pathways that might infer the phenotype. Usually in cancers the cut off is first at adj.pvalue of either 0.01 or 0.05 depending upon the number of DEGs and then still if you have a lot of differentially expressed genes then filter out genes with low fold changes as they might not give large biological effects which can range from 1-1.5 log2(FC) value for cleansing post adj.pvalue filtration to give much more robust gene sets.
Thanks, WouterDecoster and vchris_ngs.
I am pretty much new to this RNAseq. I have the following cuffdiff output files, - gene_exp.diff - genes.fpkm_tracking - genes.count_tracking - genes.read_group_tracking
similarly for isoforms, cds and tss group.
Can you please shed some light on carrying out these steps? Do you have any research articles discussing it in detail?
Perhaps it's worth trying this very recently published workflow: http://www.ncbi.nlm.nih.gov/pubmed/27560171
Thanks, I will go through it.