Question

Beginner question: box-plot and PCA in RNA-seq

0

Entering edit mode

9.8 years ago

scchess ▴ 640

This is a beginner question.

I have drawn up a box-plot and PCA plot for my RNA-seq experiment before and after normalization. However, I don't understand how to interpret the plots. For example, in the PCA plot, are the biological replicates expected to cluster by the first principal component? What about cluster on the second principal component? If I see my samples are clustered in a box-plot, does that mean my normalization has done the job?

What does a perfect normalization would look like on a box-plot and PCA plot? Any online resources?

RNA-Seq • 4.4k views

ADD COMMENT • link updated 3.1 years ago by Ram 45k • written 9.8 years ago by scchess ▴ 640

Ram · Answer 1 · 2015-11-17

2

Entering edit mode

9.8 years ago

Devon Ryan 105k

I'm not entirely sure what you're plotting in your box plot (normalized counts?), so I won't comment on that.

When you plot the PCA results you don't necessarily expect the biological replicates to cluster by group (either in PC1 or PC2), that depends entirely on the effect size and the underlying biological noise. The general use of a PCA plot is to ensure that you don't have any unknown batch effects causing outliers or really weird clustering of samples.

ADD COMMENT • link 9.8 years ago by Devon Ryan 105k

0

Entering edit mode

Thanks for the explanation. I made a box-plot and PCA both before and after normalization. It was done on the raw and normalized counts. Does that mean technical replicates are expected to cluster by the first or second component in a PCA plot? I'm trying to understand how to read the plots.

ADD REPLY • link 9.8 years ago by scchess ▴ 640

0

Entering edit mode

If you have technical replicates then they should always cluster together after normalization. If not then something went wrong (not in normalization, but in making one of the replicates).

ADD REPLY • link updated 5.8 years ago by Ram 45k • written 9.8 years ago by Devon Ryan 105k