Hi
I downloaded the RNA Seq data from the GEO (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSM3998167). I want to perform the DEG. The each txt file has around 66000 hits. The data processing information given is as under (Reads were cropped to 75bp using Trimmomatic Reads were mapped to GRCh38 (ensembl release 78) using STAR (v2.4.2a) Gene count tables were produced using HTSeq-count (v0.11.2) with -r pos -s reverse Genome_build: GRCh38, Ensembl release 78 Supplementary_files_format_and_content: txt files with gene-level counts).
My question is whether the data given is normalized or not. For my analysis i just log2 transform this data and perform further analysis. Is my approach right or I need to change it. Please suggest...
Regards
If you're starting with a count table (read counts to genes) the data is not normalized.
What does your analysis consist of?
Thanks for the reply
Differential expression, Correlation, multiple analysis