Hello,
I have some CEL files and a CDF file for them, now how I can remove Control probesets (probesets with name started by AFFX) please? I mean how I can define these probesets to enter them in text file to be excluded without need to R?
Hello,
I have some CEL files and a CDF file for them, now how I can remove Control probesets (probesets with name started by AFFX) please? I mean how I can define these probesets to enter them in text file to be excluded without need to R?
Hi Sarah, CEL files are binary files, so they will not be readable in a text editor -- you will have to use R or Matlab. Is there any reason in particular you do not want to use R?
Once they are loaded into R, it shouldn't be too difficult to remove them; given all of them have the standardised format of starting with "AFFX", you can use grep:
# load the affy library
library(affy)
# read CEL files in your working directory and normalise them
data <- ReadAffy()
exp <- rma(data)
# remove all probes starting with "AFFX"
control <- grep("AFFX",rownames(exp))
exp <- exp[-control,]
I know the feeling, I only started learning R recently too. It really is an invaluable tool to know your way around, and I'd recommend spending some time familiarising yourself with data types and methods if possible. I don't have any one resource that I use, I tend to go to either the manual pages or just simply Google things. I cannot stress how much having an underlying understanding of the language helps when amalgamating bits and pieces of code from the web though.
There is an online course that's started recently that might be of interest to you if you are interested in learning R: https://www.coursera.org/course/rprog
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Hi Alexander and thanks a lot,
I am not familiar enough with R and I don't know the nature of codes and just copying and pasting them from tutorials...
I entered your codes like below:
Now where is the resulted normalized text file please to use it in the later purpose please? I need a normalized text file
If the control probsets have removed now that I use these Cel files as a input in RMAExpress for more normalizing?
You won't need to normalise the data again. You can get the normalised text file with
Thank you very much