Is there any python package that can be used for normalizing data from Affmetrix expression microarrays?
I understand the Affy library of Bioconductor package in R can be used. However, I would like to perform normalization using libraries in python.
I found pyaffy available on github. But there are problems with installation on windows.
Any suggestion on other libraries in python?
Many thanks for the response. I am using Bioconductor packages now.
I would to confirm whether the following commands are correct for assigning the cdf file for normalization. On GEO, 3 platforms are given so I am a little confused. GEO : GSE1133
I told you in a previous post that there are three kinds of samples and this corresponds to three platforms, depending on the samples starting with
1B
,3A
andMG
. There are mouse and human data in this superset. Of course you can only compare samples from the same platform to each other. Be sure that you check which platform corresponds to the three sample groups by clicking the entries and then searching for theGPL
identifier.Thanks. Yes, I could find 3 platforms : GPL96, GPL1073 and GPL1074.
I could also get the corresponding gsets by doing
I am not sure how to proceed after the above step.
In the following procedure described in the supplementary file, RMA normalization is done before clustering.
From your response in the previous post, I understand log2 transformation has to be performed.
Could you please explain whether log2 transformation has to be performed using the normalized data?
Also, for performing
rma
normalization, which requires cfdname in the arguments that are passed, I am not sure how to specify the cdf files . For 3 platforms: GPL96 - cds file can be loaded using biocLite("hgu133acdf")Although I could locate the cds files of the following 2 platforms on GEO , custom downloads(GSE1133_RAW.tar), I don't know how to load the cdf files from biocLite.
GPL1073 GPL1074.