Hi, all I'm a newbie, and I'm dealing with Agilent single channel microarray when I filtering DEGs using limma, I find all my results showing logFC>-1.5 or logFC<1.5, without significant expression. I have been searching reason for a long time, but I still don't know what my faults are, I really need some help, thanks! My code as following:
treatment<-targets$Treatment
Level<-c("neurologically healthy control","Parkinson disease")
Group<-factor(treatment,levels = Level)
Design<-model.matrix(~0+Group)
colnames(Design) <- c("Control","PD")
fita<-lmFit(exprs(eSet),Design)
cont.matrix <- makeContrasts(Diff=PD-Control, levels=Design)
fitb<- contrasts.fit(fita, cont.matrix)
fitc<- eBayes(fitb)
##result
Result<-topTable(fitc,adjust="BH",number = Inf,p.value = 0.05)
dt_result<-decideTests(fitc,lfc =1.5,p.value=0.05,method="separate")
summary(dt_result)range
(Result$logFC)
[1] -1.471461 1.192352
One issue is here:
The parameter,
lfc
, expects a value that is already on the log (base 2) scale. So, the value that you are passing as the cut-off is actually:You need to use:
Kevin
Yes Kevin, thank for your answer. But my key question still alive. Once I set parameter lfc=1.5, summary(dt_result) showing Diff Down 0 NotSig 38822 Up 0 That's because of my topTable results with no changes, with nothing significant expression. I really don't know how to get rid of this situation. Could you help me, Kevin? Very thanks!
You are in good hands, now that Devon has added a comment. Please follow up with both him and Fabio (Fabio is another great guy)
Kevin, thank you very much! Devon and Fabio both experienced than me, I will learn modestly from them. Thanks again, wish you have a good day!
Here is my complete code, really hope to get some help.
Does
Result
indicate you have any DE genes?Nothing. I just checked my code again, finding some wrongs. So sad, I always make mistakes. Let me correct it and then discuss with you.
Devon, thanks for your caring! Have a good day!
BTW, my data from this article,https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1002794. I am trying to replicate the Agilent part results (the first part) of this article. As a newbie, It's apparently I failed completely.
Best wish for you again.
If there really are DE genes in that dataset (from the paper I guess one would expect so) then it's likely that there's either an outlier sample that needs excluding or some of the sample labels are swapped. Make a PCA plot and do some clustering and see if that indicates either of those.
Do not worry - we all make mistakes. At least you are honest about it, whereas others would prefer to lie about their mistakes. You are a great person.
How many replicates do you have per condition?
Do you see very high variation between replicates?
Did you try doing the analysis with a toy data set (probably provided by the package developer) to see if using that you got significant results?
Are there additional software packages you may try to see if the result is due to some problem in the analysis with this specific package?
Yes, Fabio. Q1, 27 PD, and 26 control samples;
Q2, my analysis data from this published article,https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1002794. The bad samples have been removed by the authors, I get the cleaned data from ArrayExpress,https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-812/
Q3, not yet.
Q4, I made this analysis using limma at first and got the same results.
I reviewed my practice article again, thinking there may be some faults in my limma lmFit object and model design. I will correct them firstly.
Thanks for your precious time. Fabio, wish you have a great day!
I added code markup to your post for increased readability. You can do this by selecting the text and clicking the 101010 button. When you compose or edit a post that button is in your toolbar, see image below:
In addition, cross-posted: https://support.bioconductor.org/p/112875/
Yes, thanks for your sweet tip, I will notice my code layout next time.