I want to analyze a chip-seq data set in GEO (GSE11431), and I have two questions:
Its SRA file in each of GSM have multiple files/"runs", what does it mean? is it replicates or what? Do I just download all of them and run bowtie individually using same reference genome, and then combine them afterward?
Another question is where can I download mouse reference gene (mm9 version) for bowtie?
Thank you very much for any idea and help!!
What do you mean by "multiple "runs" will be merged at some point in the analysis pipeline"? How can I merge them?
You can merge the raw fastq files before performing any analysis or you can merge the bam files after alignment. It depends on the type of analysis you are doing. Merging after alignment would be good as you can compare the differences between multiple runs after alignment and then merge.