Weird MAplot with DEGs lying along the diagonals
1
0
Entering edit mode
3.8 years ago
FedeXandeR ▴ 20

Dear BioStarrers, I'm currently reprocessing some (not so) old microarray data originally acquired using the following platform: Agilent-026652 Whole Human Genome Microarray 4x44K v2 comparing a tumor sample and the matched "healthy" region nearby, in the same biopsy.

I'm interested in differential gene expression, as usual, but I get really weird MAplots (and quite compressed BoxPlots) in the preprocessing step. Here is an example:

BoxPlots of Raw data https://ibb.co/LN1rptj

MAplot of Raw data for one tumor sample vs its matched healthy sample https://ibb.co/n336Kv6

BoxPlots of Background-subtracted data https://ibb.co/DK52MFT

MAplot of Background-subtracted data for one tumor sample vs its matched healthy sample https://ibb.co/3R0PzxD

BoxPlots of Quantile-Normalized data https://ibb.co/mb3KZxK

MAplot of Quantile-Normalized data for one tumor sample vs its matched healthy sample https://ibb.co/271T7xw

It seems like the typical bulk of the highly-expressed-unchanged genes is completely missing, while all the DEGs have been "pushed" towards the two diagonals of the plot... Never seen something like this before... Do you think this is biologically reliable? Maybe some problem with hybridization or scanning?

Any help/suggestion/comment from anyone more experienced than me is welcome!

Thank you so much!

I think the problem is in the biology or in the wet-lab step, and not in the coding. However, here is the (very essential and almost standard) code I used:

library(limma)
targets = readTargets("Targets.txt", row.names = "SampleNumber")
raw = read.maimages(targets, source = 'agilent.median', green.only = TRUE)
   # Here I generate some QC plots of Raw data
raw_BGcorrected = backgroundCorrect(raw, method = "normexp", offset = 50)
   # Here I generate some QC plots of BG-subtracted data
raw_BGandNormalized = normalizeBetweenArrays(raw_BGcorrected, method = "quantile")
   # Here I generate some QC plots of Normalized data
microarray maplot normalization agilent • 1.6k views
ADD COMMENT
0
Entering edit mode
ADD REPLY
0
Entering edit mode

Sorry, I post so rarely that I didn't know cross-posting was an issue... do you think I should remove one of the two?

ADD REPLY
0
Entering edit mode

It's fine, we just want folks to be able to find the answer if it is answered elsewhere. If you get an answer there before here, please add the answer here.

ADD REPLY
0
Entering edit mode

Oh, ok, got it! I'll do it. Thank you!

ADD REPLY
2
Entering edit mode
3.8 years ago
seidel 11k

Your unnormalized expression plots have 50% of your data at only 64 counts. That seems really low. If you examine histograms of the background and foreground signal in each channel, do they look different? Do you have a data set based on a successful hybridization that you can use for reference, for what distributions of expected values might look like? Also, Agilent has various hybridization controls that you can look up - have you checked out the values of those?

ADD COMMENT
2
Entering edit mode

Thank you for your accurate advice. Actually I suspect a general failure of chip hybridization, as also suggested by Gordon Smyth on Bioconductor Forum where I cross-posted the same issue: Bioconductor cross-post. By the way, the analysis of Agilent control probes seems to confirm that. Thank you again bye

ADD REPLY

Login before adding your answer.

Traffic: 2533 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6