Working with different batches: when to apply normalization
0
0
Entering edit mode
3.2 years ago
mincej20 • 0

I have publicly available data that was collected in two batches. The data is a superseries combined with two subseries from the same study that analyzes the same biological questions (ALS vs control).

I'm aware that normalization and transformation is a must, but should I:
-Normalize and transform each batch separately, combine, and batch effect correct using ComBat
OR
-Combine the batches, normalize and transform, and then batch effect correct?

Thanks for any help

EDIT:
I've done some digging, and it looks like global quantile normalization can reduce meaningful biological variation across groups (in the above case, between ALS and control).
As a result, there are papers (open access links below) that recommend within-class quantile normalization to normalize data while still maintaining meaningful biological differences. The first paper linked shows that these methods can also reduce batch effects for batches derived from the same experimental project (combining data for meta-analysis is a different beast, and more complicated).

Hope this information helps!

  1. How to do quantile normalization correctly for gene expression data analyses - Nature Scientific Reports
  2. Smooth quantile normalization - Biostatistics
  3. quantro: a data-driven approach to guide the choice of an appropriate normalization method - Genome Biology
combat microarray batch transformation normalization • 626 views
ADD COMMENT

Login before adding your answer.

Traffic: 1620 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6