Is plotMDS (from limma package) a PCA (Principal Component Analysis) or a PCoA (Principal Coordinate Analysis)?
1
2
Entering edit mode
4.6 years ago
harelarik ▴ 90

A. Is the plotMDS function (from the limma package) based on PCA (Principal Component Analysis) or a PCoA (Principal Coordinate Analysis)?

B. What the values on the plot represent? Is it root-mean-square average or root-mean-square deviation (see below)?

  • In the limma paper published in Nucleic Acids Res at 2015 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4402510/) it is stated that: "Distances between samples on the plot represent ‘leading fold change’, which is defined as the root-mean-square average* of the log-fold-changes for the genes best distinguishing each pair of samples. "

  • In the R manual it is stated that: "The distance between each pair of samples (columns) is the root-mean-square deviation (Euclidean distance) for the top ‘top’ genes. Distances on the plot can be interpreted as _leading log2-fold-change_, meaning the typical (root-mean-square) log2-fold-change between the samples for the genes that distinguish those samples."

Thank you,

Arik

limma PCA plotMDS RNA-Seq R • 7.4k views
ADD COMMENT
0
Entering edit mode

This video may be helpful for a basic understanding on this topic.

ADD REPLY
0
Entering edit mode

Sorry but the link does not work.

ADD REPLY
0
Entering edit mode

Should work now.

ADD REPLY
1
Entering edit mode
4.6 years ago
Gordon Smyth ★ 7.7k

A. The plotMDS documentation help(plotMDS) explains in detail that's doing multidimensional scaling (MDS) and PCoA is just a synonym for MDS. PCA is a special case of PCoA so in some cases plotMDS is also doing PCA. If the axis labels says "Principal component" then it is PCA (and PCoA). If the axis label says "leading logFC" then it's just PCoA.

B. Both expressions mean the same thing, it's the root-mean-square of the largest logFCs: sqrt(mean(logFC^2)).

ADD COMMENT
0
Entering edit mode

Thank you very much! A. How can I make the function generate PCA? So the axis label will show "Principal component".

Arik

ADD REPLY
0
Entering edit mode

gene.selection="common"

ADD REPLY
0
Entering edit mode

Thank you very much!

As common option will "select the same genes for all comparisons" when would you recommend to use it?

ADD REPLY
0
Entering edit mode

I recommend you use the plotMDS default settings, unless your dataset is very large (hundreds or thousands of samples) in which case gene.selection="common" is much faster.

ADD REPLY
0
Entering edit mode

Dear Dr. Smyth,

Thank you very much!

Arik

ADD REPLY

Login before adding your answer.

Traffic: 1978 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6