Hi All,
I am not sure, how many of you have faced this problem. I have a RPKM from a RNA-seq run that was performed using ribo-zero method and I have another set of RPKMs from RNA-seq run that was performed using smarter kit.
Now due to difference in method a direct comparison is not of utility, so how can I normalise or scale these values to compare them?
For one of the files I only have RPKMs and not the read counts, so it is a bit difficult.
Any ideas?
Thank you
I think Yes,
there would be batch effect and there are RPKM variability issues as discussed several times on this forum. Better to ignore RPKM and start working with read counts per million. you can go for quantile normalization to remove batch effect arising from different RNA-seq source.
For one of the samples I only have RPKMs and not the tags, I have updated this information in my question. Can suggestions now?
Am I correct in guessing that the thing you're interested in measuring is partitioned across the method batch-effect? If so, you might look into RUV-2, since you'll need to use control genes for normalization.
Yes, it is a difference in sample prep method for RNA-seq, what you are suggesting looks promising but it is for microarray? isn't it?
The same method applies. They have a later paper that describes the method in an RNAseq context.
you finally play with numbers which are expression levels (could be normalized by some endogenous control), so doesn't matter whether it comes from RNA-seq or microarray.
While this is true, it should be noted that the values derived from RNAseq aren't independent of each other (e.g, an increase in signal from gene A will lead to an apparent decrease in signal from gene B), which can affect how well some methods work. This is also part of the reason why RPKM stinks as a metric.