DiffBind Normalization ATAC-Seq Question: When to drop a sample?
1
0
Entering edit mode
7 weeks ago
John • 0

The first image with less Differntially Accessible Regions (DARs) was generated by excluding a sample that had an FDR of 0.11 and the second image with more DARs is just not excluding that low FDR sample.

My question is can I get away with using that second image even though the normalization line has a hump in it or is that a sign that I should throw out that sample? I am trying to avoid this as it significantly decreases the number of reads I have to work with and decreases the number of DARs by about 65%.

-sample Excluding a sample that had an FDR of 0.11

VS

+sample Not excluding that sample

ATAC-Seq DiffBind • 567 views
ADD COMMENT
0
Entering edit mode
6 weeks ago
ATpoint 87k

Usually, you perform some exploration like PCA and plotting metrics like sample2sample correlation, sequencing depth etc. to see whether a sample is an outlier that merits removal. From the MA-plots you see the lower one has notably larger fold changes and some sort of non-linear bias. That itself could be corrected by something like loess normalization e.g. from the csaw package. Would need to see more plots to decide on removal of samples.

ADD COMMENT
0
Entering edit mode

The PCA plots were kind of a mess. I excluded the sample FOX4, which made the normalization linear but sacrificed many DARs.

enter image description here

ADD REPLY
0
Entering edit mode

which made the normalization linear but sacrificed many DERs.

Frankly, I dont think that the "DERs" in the lower plot a real but a normalization artifact. The upper pink points should be much lower.

ADD REPLY
0
Entering edit mode

Thank you for the advice. I went back and re-normalized using a different method. I used all methods, which I thought applied the loess method, but I am unsure. However, it seems to have helped flatten the curve, although I am unsure of what method was responsible. Found the method here: https://support.bioconductor.org/p/9152726/#9152964

atacV2Loes <- dba.normalize(atacV1Loes, method = DBA_ALL_METHODS, offsets = TRUE)

enter image description here

ADD REPLY
0
Entering edit mode

I did a little bit of digging and found that I used Deseq2 to normalize.

ADD REPLY

Login before adding your answer.

Traffic: 1756 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6