This is a follow-up to my post over here
I'm collecting the gene expression data from microarray studies. How do we filter the studies performed using a specific platform, e.g. Affymetrix Human Genome U133 Plus 2.0 Array?
Update: How do we filter the data from GSE files? I found a tutorial that shows how to parse from GSE files. But what I could get is the probe ids and the expression values for each sample. For instance, from GDS one could use Table(gds)/Column(gds) to filter the gene synmbols and the sample description. For GSE, it is mentioned that object class is not available. Could someone help me in filtering the gene symbols and sample description from the expression set that is created using GSE data?
Hello Kevin, Thanks a lot for the response. I'm parsing the expression values from GDS files in Bioconductor following the tutorial given here. Most of the studies from the above search have reported GSE files. Could you please suggest whether there are tutorials on how to parse data from GSE files in Bioconductor?
Edit: I found a tutorial that shows how to parse from GSE files. But what I could get is the probe ids and the expression values for each sample. For instance from GDS one could use Table(gds)/Column(gds) to filter the gene synmbols and the sample description. For GSE, it is mentioned that object class is not available. Could someone help me in filtering the gene symbols and sample description from the expression set that is created using GSE data?
Hey Natasha, so, you can obtain some data but the issue is that the IDs are Affy probe IDs? If that's the case, then there are ways to convert these to gene symbols, here:
Hi Kevin, Yes, these IDs are the Affy probe IDs. Thanks for the informative links.
Could you please suggest how to get the sample descriptions from the expression set created using GSE files.?
For instance , using the eset of GDS files I obtain, pData(eset)
Likewise, is there any syntax that can be used to find the cell.type / sample description?
I tried pData(eset) for GSE after creating the eset following the instructions given here, it didn't work though.
Oh, I just tried it for that dataset, and it works like this:
The ExpressionSet is stored as
gse[[1]]
: