Question

How to construct GCN?

0

Entering edit mode

5.9 years ago

Arindam Ghosh ▴ 530

I have gene expression values for TEST and CONTROL samples from RNA-seq. I have determined the differentially expressed genes (DEG). Now I need to prepare a co-expression network of only the DEG. I did this by calculating the Pearson Correlation Coefficient (PCC) values using the corr.test() in R and then using a suitable threshold. I need to clarify a few things here. While calculating PCC should I consider gene expression values from both TEST and CONTROL? Is it logical to build separate networks for TEST and CONTROL?

RNA-Seq network gene coexpression network • 1.4k views

ADD COMMENT • link updated 5.9 years ago by Kevin Blighe 88k • written 5.9 years ago by Arindam Ghosh ▴ 530

score 1 · Answer 1 · 2018-12-25

1

Entering edit mode

5.9 years ago

Kevin Blighe 88k

Yes, it makes sense to construct separate networks for TEST and CONTROL and to then compare these [networks]. In fact, I believe this is preferable to just constructing a network on the entire dataset and then doing post-correlations on modules, as is done in WGCNA.

When comparing the networks, you can look at:

hub scores for each vertex (gene) in each network
vertex degrees
betweenness centrality
edge strengths between vertex pairs
module / community memberships

Please read: Gene co-expression analysis for functional classification and gene-disease predictions.

Kevin

ADD COMMENT • link 5.9 years ago by Kevin Blighe 88k

0

Entering edit mode

But constructing only network with gene expression for single condition will give how the genes are correlated between the samples of a single condition. It won't tell much if any two genes are correlated across conditions. Again, for any given condition the gene expression values should be same across samples/replicates, atleast theoretically. In such circumstances, the PCC ~ 0 if we try to calculate the gene-gene PCC using gene expression data from one condition.

ADD REPLY • link 5.9 years ago by Arindam Ghosh ▴ 530

1

Entering edit mode

...and what does it actually mean, biologically, if 2 genes are correlated "across conditions"? - there are different ways to interpret this definition. On the other hand what does it mean if geneX is correlated to geneY in TEST but not correlated in CONTROL? In both cases, it may mean absolutely nothing because it's just a correlation.

Look, don't think too much about network analysis - it will never help you to fully understand a disease mechanism. If you have been asked to do it by your PI, you can do it and tick it off the list [of things to do]. In network analysis, people always highlight very isolated examples as success stories, much like DTC (direct to consumer) genetic testing companies; however, most people with whom I have worked have become so worried about these analyses that they decided to not include them in results.

ADD REPLY • link 5.9 years ago by Kevin Blighe 88k