Hello, I am new at RNA-Seq data analysis and I want to analyze the data and do some analyses such as finding differentially expressed genes. My data set is from NCBI GEO and coded as GSE80336. Link is here:
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE80336
I see data is not normalized. How can I download the data from GEO with R, normalize it and make analyses? Is there a good pipeline for that?
Also there is a file called "Counts.txt" in supplementary. What is this file actually and can I use it?
Thank you.
Not sure about downloading data with R, but you can download the raw sequence reads with the fastq-dump command from the SRA Toolkit. Have a read of the following workflow for analysing RNA-seq data with R and Bioconductor: https://f1000research.com/articles/4-1070/v2