I have multiple raw data files which are tar archives. I downloaded them from GEO and opened on 7-Zip. Each one of these files contains many txt.gz files and/or CEL.gz files.
Example: The tar file - GSE49355_RAW.tar
The files it contains - GSM1197996_092302_HG-U133A_AF07130_1.CEL.gz
GSM1197997_092302_HG-U133A_AF07149_1.CEL.gz
GSM1198020_CGCLG_073003_HG-U133A_AF09317_1.CEL.gz
etc..
My goal is to get a raw RNA expression data, with the samples in the columns and genes in rows. I've been trying to read the tar files into R using the Untar() function and the getGEO() function, but all I get is a vector of the names of the files, I can't access the data that I want.
How can I read the tar files AND the files in it, in R ?