I followed https://www.bioconductor.org/help/workflows/RNAseq123/ for my rnaseq analysis. As my read counts were around 2.5 million, I had to use higher CPM. Hope this should be fine for downstream analysis.
After following command:
par(mfrow=c(1,2))
lcpm <- cpm(y2, log=TRUE)
boxplot(lcpm, las=2, col=group$Sample, main="")
title(main="A. Example: Unnormalised data at CPM-2",ylab="Log-cpm")
y2 <- calcNormFactors(y2)
y2$samples$norm.factors
lcpm <- cpm(y, log=TRUE)
boxplot(lcpm, las=2, col=group$Sample, main="")
title(main="B. Example: Normalised data at CPM-2",ylab="Log-cpm")
I got following boxplot graph. ![enter image description here][1] [1]: https://ibb.co/gWW2r6
Based on normalized data, which samples should I remove from analysis ?