Hey everyone!
I hope that question is allowed as it is on the intersection of wet-lab and dry-lab.
We have prepared Illumina libraries with the RiboZero kit, however we noticed afterwards that the normalization was messed up so different amounts of total RNA went into the library prep. Effectifely, we added between 20 ng and 500 ng of total RNA as input to the library preparation due to the miscalculation.
I am wondering if there will be an observable effect on the low amount samples when we perform differential gene expression analysis between the samples? Will we for example recover less genes? We are of course going to normalize the libraries and sequence them to the same depth. I think that even 20ng should be enough to capture the diversity of the transcriptome, but I was hoping someone had maybe experience with lower than 100ng input samples.
Thanks a lot in advance.
Intuitively I would argue that it will not make a notable difference but the library with the 20ng will require slightly more PCR amplification and might therefore suffer a bit more from PCR bias. You can of course after normalization do quality control via e.g. PCA or MDS plots and then see if there is a sign of batch effect, but I would not expect it especially if RNA was of high quality (RIN) as the kits cover a wide range of inut amounts. I cannot really argue for RNA-seq but for ATAC-seq where I made library from the same cell populations of different donors ranging from roughly 10.000 to 50.000 cells with no sign of cell-number-based batch effect at all. This might happen if you process the exact same specimen with different input amounts but I assume that the biological variation will dominate these slight differences in library prep. Still, I have no hard proof for my statement towards RNA-seq.
Thanks for the input! Even without proof it is at least comforting to read :)