Question

Differential peak analysis for ATAC-seq using transformed and normalized CPM matrix

0

Entering edit mode

2.4 years ago

mmitra ▴ 60

Hi all, I have a ATAC-seq data matrix where the rows are consensus peaks and each column represents either condition A or B. There are a total of 8 columns in the matrix: 4 samples for condition A and four samples for condition B. Each entry of the matrix is obtained by converting the raw count to CPM followed by log2 transformation and then performing the quantile normalization. I am wondering what would be best way to do the differential peak analysis between conditions A and B?

The differential expression (or peak) analysis programs I know (such as DESeq2) require raw counts as input and not the transformed and normalized data that I have. I did not generate this data and this is all I have.

Thanks a lot in advance for any suggestions. Any help would be greatly appreciated.

ATAC-seq differential normalization matrix analysis CPM • 1.2k views

ADD COMMENT • link 2.4 years ago by mmitra ▴ 60

score 4 · Accepted Answer · 2022-07-28

4

Entering edit mode

2.4 years ago

LChart 4.7k

The log2-CPM transformation is very close to limma's VOOM transform of raw counts; as such I would recommend using the limma package in R to perform the subsequent differential expression modeling.

ADD COMMENT • link 2.4 years ago by LChart 4.7k

2

Entering edit mode

Yes, limma-trend, so using trend=TRUE in the eBayes() (or treat() function. My usual word of caution though, if you do not have the actual raw data at hand your analysis is, strictly speaking, not reproducible. Also, in case you would find that normalization performance was not good, e.g. diagnosed by MA-plots after the DE analysis, then there is little to no way to change that since you do not have the raw counts. Your choice to go along with that or not.

ADD REPLY • link 2.4 years ago by ATpoint 86k

0

Entering edit mode

Thank you both for the suggestion. I will try limma as per your recommendation. I would like to get the actual raw data (i have to get the authorization), but all the ATAC-seq QC and analysis is published. They, however, did not do the differential peak analysis like the way I wanted to do.

ADD REPLY • link 2.4 years ago by mmitra ▴ 60