Hello,
Please help me with understanding my boxplot.
Here is the boxplot, I got for my RNA-Seq data.
My data is
head(rawCountTable)
con-1 con-2 a-1 a-2 b-1 b-2 c-1 c-2 d-1 d-2
ENSMUSG0000000000 0 0 0 0 0 0 0 0 0 0
ENSMUSG00000000028 854 937 1143 1029 912 856 809 754 513 520
ENSMUSG00000000031 822918 817451 716860 691396 763705 829274 838094 819312 717935 730879
The code for Boxplot is below:
pseudoCount = log2(rawCountTable + 1)
df = melt(pseudoCount, variable.name = "Samples",
value.name = "count") # reshape the matrix
df = data.frame(df, Condition = substr(df$Samples, 1, 4))
Here is my code for the density plot.
ggplot(df, aes(x = count, colour = Samples, fill = Samples)) +
ylim(c(0, 0.17)) +
geom_density(alpha = 0.2, size = 1.25) +
facet_wrap(~ Condition) + theme(legend.position = "top") +
xlab(expression(log[2](count + 1)))
The density Plot is
So, my question is I want to know how to interpret these plots? How is my data quality? If you can recommend me an article about understanding these plots and assess my data, I would appreciate it.
Thank you very much!
The image links are broken. Try hosting and embedding them by pressing the image button in the post.
I've fixed it. OP used the embed code in image direct link field.