Question

DiffBind Normalization ATAC-Seq Question: When to drop a sample?

0

Entering edit mode

7 weeks ago

John • 0

The first image with less Differntially Accessible Regions (DARs) was generated by excluding a sample that had an FDR of 0.11 and the second image with more DARs is just not excluding that low FDR sample.

My question is can I get away with using that second image even though the normalization line has a hump in it or is that a sign that I should throw out that sample? I am trying to avoid this as it significantly decreases the number of reads I have to work with and decreases the number of DARs by about 65%.

-sample Excluding a sample that had an FDR of 0.11

VS

+sample Not excluding that sample

ATAC-Seq DiffBind • 567 views

ADD COMMENT • link 6 weeks ago by John • 0

score 0 · Answer 1 · 2025-03-09

0

Entering edit mode

6 weeks ago

ATpoint 87k

Usually, you perform some exploration like PCA and plotting metrics like sample2sample correlation, sequencing depth etc. to see whether a sample is an outlier that merits removal. From the MA-plots you see the lower one has notably larger fold changes and some sort of non-linear bias. That itself could be corrected by something like loess normalization e.g. from the csaw package. Would need to see more plots to decide on removal of samples.

ADD COMMENT • link 6 weeks ago by ATpoint 87k

0

Entering edit mode

The PCA plots were kind of a mess. I excluded the sample FOX4, which made the normalization linear but sacrificed many DARs.

enter image description here

ADD REPLY • link 6 weeks ago by John • 0

0

Entering edit mode

which made the normalization linear but sacrificed many DERs.

Frankly, I dont think that the "DERs" in the lower plot a real but a normalization artifact. The upper pink points should be much lower.

ADD REPLY • link 6 weeks ago by ATpoint 87k

0

Entering edit mode

Thank you for the advice. I went back and re-normalized using a different method. I used all methods, which I thought applied the loess method, but I am unsure. However, it seems to have helped flatten the curve, although I am unsure of what method was responsible. Found the method here: https://support.bioconductor.org/p/9152726/#9152964

atacV2Loes <- dba.normalize(atacV1Loes, method = DBA_ALL_METHODS, offsets = TRUE)

enter image description here

ADD REPLY • link 6 weeks ago by John • 0

0

Entering edit mode

I did a little bit of digging and found that I used Deseq2 to normalize.

ADD REPLY • link 6 weeks ago by John • 0