Dear Community,
I performed a surrogate variable analysis on my RNA-seq data using the SVA package in R and want to use the identified surrogate variables as covariates in my DE design formula. There were more than 5 significant SV identified - human samples from different locations and different timepoints, hence high heterogeneity and several confounding factors. The output of sva is anyway limited to the first 5 SV identified and I was unable to successfully increase the number of sv included in the svaseq output file. I am using the code given below:
n.sv
9
svseq=svaseq(edata,mod,mod0,n.sv=n.sv,method="two-step")
I couldn't find any helps/options adapting the output in the vignette/R help/forum, hence the question. Thanks in advance for any help!
Here you go:
edata=data.matrix(x, rownames.force = TRUE)
meta2=meta$condition
mod = model.matrix(~as.factor(meta2), data=meta2)
mod0 = model.matrix(~1, data=meta2)
n.sv = num.sv(edata,mod,method="leek")
n.sv
[1] 9
svseq=svaseq(edata,mod,mod0,n.sv=n.sv,method="two-step")
Do you spot any obvious mistake?
Seems fine, recently they developed batchQC library, here is an example.
tell me what you got