Hi I was retrieving differentially expressed genes through R and got an error at the very last step. Please help me out in removing following error it will be very helpful.
Error
results= topTable(fit2, adjust ='BH', number = nrow(gse32269eset))
Error in (function (..., row.names = NULL, check.rows = FALSE, check.names = TRUE, :
arguments imply differing number of rows: 54675, 22283
ADD COMMENT
• link
updated 8.5 years ago by
ATRX
★
1.1k
•
written 8.5 years ago by
786
▴
50
0
Entering edit mode
I think It's related to selecting the number of DEGs,why you select all the prob-id in your GSE? try with a number,lets say 100, I suggest you specify log fold change for your topranked gene e.g logFC=0.05
try using: topTable(fit2, adjust ='BH', number = Inf)
I think you are filtering some probes or aggregating the probes due to which there is a difference in the number of rows between fit2 and nrow(gse32269eset)
Sorry I have ovary cancer samples stage 1, 2, 3 and normal samples
I was going to have something like your above code resulted in P.value , adj.P.Val , logFC
I did like below
library(limma)
library(affy)
Data<-ReadAffy()
eset<-rma(Data)
exprs<-exprs(eset)
dim(exprs)
[1] 54675 4
design <- model.matrix(~ type + Tmprss2.erg + 0, as.data.frame(group))
Error in as.data.frame(group) : object 'group' not found
Actually I was going to compare normal with rest of cancers in 3 stages but I got error
Com'on.. you need to define the group first.. i.e. which sample belongs to which group. Read about limma first and try to understand the code. Please think and apply. I am sure you don't want yourself to be spoon-fed.
I think It's related to selecting the number of DEGs,why you select all the prob-id in your GSE? try with a number,lets say 100, I suggest you specify log fold change for your topranked gene e.g logFC=0.05