From GEO one can download SRR files (ending in .sra) of illumina data that can be extracted as fastq with fastq-dump
. how can the sample information for these SRR IDs be read programmatically from the GEO/SRA metadata? The project SRP005601 has a sample "SRR097786" which is not described anywhere in the SOFT/minimal files - those files are incredibly complicated. How can I find the information describing the sample label from GEO?
the only manual solution I found to this is through http://trace.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?view=search_obj ("Search SRA objects") in NCBI trace. I type in the SRR* id manually and then click around until I find the sample information, for each sample. this is a terrible manual solution so I was hoping to download this metadata and parse it from a csv file.