Entering edit mode
6.6 years ago
farzaneafzali
▴
80
Hi everybody I have a problem with analyzing micro array data. When i try to do log2 transformation to make "min and max" range normal, it gives me Nan Value so i can not continue the analysis. The codes i use are:
ex<-exprs(gset)
dim(ex)
ex<- log2(ex+1)
exprs(gset) <- ex
min(ex)
max(ex)
** when i add a bigger value to ex (like ex+5) it doesn't give me Nan value anymore. is it wise to do that? If not, Plz help me to get rid of Nan Value . Tnx a lot
P.S= I don't have a very high skill in analyzing and R programming so if u can give me a code that can solve my problem i really appreciate it.
what is the range of the exprs matrix before you apply log-transformation?
You should not encounter this issue with microarray data because the QC and normalisation methods for microarray analyses are very 'mature' and stable. Please show your code that tells us how you created the gset object - the data in
exprs(gset)
may already be quantile normalised and transformed by log base 2.if you do
hist(exprs(gset))
, how does the result look? - paste the image here via imgbb.Thank you!
شكرا
Hi Kevin thanks for your reply! this is my code for creating gset object in GSE29226 dataset.
It's not normalized yet because it gives me
.
It looks like that data may already be normalised, if obtained this way:
Thanks a lot Kevin. I thought i should check min(ex) and max(ex) to be sure about log2 transformation. I didn't know that i can draw log plot. By the way, do you know what's the benefit of knowing min and max(ex)? Because in my analysis they are -21.7958 and 37248.49 and seems to be high...
Knowing min and max is not too relevant in this regard, in my opinion. Min and max values will differed a lot across different microarray types and versions.
I believe that such a range is typical for this array type, i.e., the Illumina HT Expression BeadChip. For your study of interest, the true raw data is accessible via a link at the bottom of the GEO Accession record:
[source: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE29226]
The data that you obtain via my code is the unlogged, normalised counts, from what I understand. You should confirm this with the authors of the study.
I didn't pay attention to supplementary data. Thanks a lot again dear Kevin for your help.
Hi dear rush. These are info you wanted in GSE29226.
[1] 48803 24
[1] -35.7876
[1] 59600.26