Retrieving data from NCBI GEO and RNA-Seq Data Analysis
1
0
Entering edit mode
7.3 years ago
hkarakurt ▴ 190

Hello, I am new at RNA-Seq data analysis and I want to analyze the data and do some analyses such as finding differentially expressed genes. My data set is from NCBI GEO and coded as GSE80336. Link is here:

https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE80336

I see data is not normalized. How can I download the data from GEO with R, normalize it and make analyses? Is there a good pipeline for that?

Also there is a file called "Counts.txt" in supplementary. What is this file actually and can I use it?

Thank you.

RNA-Seq R GEO Normalization Count • 12k views
ADD COMMENT
2
Entering edit mode

Not sure about downloading data with R, but you can download the raw sequence reads with the fastq-dump command from the SRA Toolkit. Have a read of the following workflow for analysing RNA-seq data with R and Bioconductor: https://f1000research.com/articles/4-1070/v2

ADD REPLY
1
Entering edit mode
7.3 years ago
theobroma22 ★ 1.2k

You can use the Bioconductor GEOquery package to retrieve / download datasets and platforms in R. You can normalize RNA-seq data a few different ways, so check out the Bioconductor Limma package. The counts file must be just that, the counts of each read. Of course you can use it, but should you use it for your analysis is a different question.

ADD COMMENT
0
Entering edit mode

I tried to download it with getGEO() command but the expression matrix is empty. I used exprs() command for that. I am not sure how can I download non-normalized data.

ADD REPLY
0
Entering edit mode

Can you post all of your code not just the functions you used. This will tell me which files you are trying to get from GSE80336, and why it is empty. Also, post any errors you may get too.

ADD REPLY

Login before adding your answer.

Traffic: 2791 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6