Let's say I have a deseq results like this.
baseMeanA baseMeanB
3479.11063850396 20.4138377244996
9918.86007582945 369.393254062374
1209.76762592568 59.2973381521179
948.474278149218 9009.30704907915
1341.44301378154 127.34346390045
2394.84611662839 326.621403591993
769.478047782662 4793.36351521464
2817.64755732181 427.718504703801
1808.47915508278 266.351977929185
... ....
This was generated from
baseMeanA = the list of integer counts in A / sizeFactorA
baseMeanB = the list of integer counts in B / sizeFactorB
Just by Iist of normalized values, is there a way to infer(reverse engineer) the sizeFactorA and sizeFactorB used to get the normalized values?
My goal is to recover raw counts from this.
Are you interested this in an academic sense? I mean -- are you just trying to find a clever way to solve the riddle, or are you constrained by not having the original data? I ask because you can simply use the
sizeFactors()
function on your originalCountDataSet
(DESeq) orDESeqDataSet
(DESeq2) to get these numbers.