Question

How to interpret this plotMDS of three disease clusters?

0

Entering edit mode

10 months ago

egascon ▴ 60

Hello,

I have been analysing the dataset GSE140829, which is made up of three groups in the diagnostic variable: AD, MCI and Control.

I have done a previous quality analysis with a plotMDS (after normalising the data) to see how each of the groups behave and to see their distribution.

enter image description here

As you can see in the image, the three groups are intermingled.

I am currently learning. My doubts are:

1) How should I interpret this, can I say that there are practically no differences or outliers?

2) Should I apply any corrections when doing the lmfit analysis? For example, arrayWeights()

3) Does the size of the groups influence the plotMDS? There are more than 600 samples.

Thank you very much for your help.

microarray plotMDS DEGs • 557 views

ADD COMMENT • link updated 10 months ago by ATpoint 87k • written 10 months ago by egascon ▴ 60

2

Entering edit mode

arrayWeights is imo always a good idea with human (or generally large) cohorts. What you can also do is to use something like sva to estimate factors of unwanted variation. Never done this for microarray, but for RNA-seq here is a great read from the DESeq2 author: https://github.com/mikelove/preNivolumabOnNivolumab/blob/main/preNivolumabOnNivolumab.knit.md

Generally, you can be almost certain that human cohorts are confounded by a lot of factors, including age, dietary status, medication, disease status beyond the actual disease/condition you're investigating, so strict univariate analysis needs either a very clear effect, and/or a large n.

ADD REPLY • link 10 months ago by ATpoint 87k

score 1 · Answer 1 · 2024-05-23

1) How should I interpret this, can I say that there are practically no differences or outliers?

You can say that the inter-group variance is not the strongest source of variance in this experiment. But the lead leading dimensions only account for 31% of the variation, leaving nearly 69% still to be explained. So you have no grounds to say that there are "practically no differences".

I don't see any outliers that would warnent any comment.

2) Should I apply any corrections when doing the lmfit analysis? For example, arrayWeights()

I don't see any grounds for that from this evdience alone.

3) Does the size of the groups influence the plotMDS? There are more than 600 samples.

If some groups are larger than others, then the that will partially bias the MDS towards the variance structures within the larger groups.

How to interpret this plotMDS of three disease clusters?

I tend to recommend not reading too much into plots like this, unless there is something really obvious in it. You might be able to say that the differences between indeivduals within the goups are larger than the differences between the groups, but that doesn't mean there arn't any differences in the group means.