I would like to conduct a gene expression meta-analysis between age and particular ageing-related phenotypes in human, mouse and rat.
To construct my data set, I only want to extract gene expression data from healthy controls from human, mouse and rat studies that have expression data at different ages/multiple timepoints (i.e. so I have a set of normal gene expression levels for mouse, rat and human at different ages and can correlate this to various phenotypes).
When i search "age" in GEO data set browser to identify suitable studies: https://www.ncbi.nlm.nih.gov/sites/GDSbrowser; there are 1,214 results.
However, if I select one result at random, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE51843; I can see this is just a case-control expression study for cancer, with no age/time aspect to it. Thus, clearly I am not properly selecting the subset of studies that has expression data for different aged controls in the analysis.
How would you advise to specifically extract, from GEO, the data sets in human, mouse and rat that will have gene expression information for healthy control at different ages.
While you are at the mercy of metadata submitted with datasets have you tried an advanced search with "atrribute name" set to age (add any additional terms that you need to refine)?
Thank you. Unfortunately this will not work I do not think. When I set the attribute name to age, one of the first results that I obtain is: https://www.ncbi.nlm.nih.gov/sites/GDSbrowser?acc=GDS5649, which is just another case-control cancer study.