Hi fellow biostars,
I have RPKM for 20 samples(10 disease and 10 control). I'm trying to build a classification model, so I need to do between-sample normalization. I know that I could do deseq/TMM on raw read counts, but is it okay to do deseq/TMM on RPKM? Thanks a lot!
Update: I found that deseq doesn't support continuous data like RPKM. So I guess my question is: is there any between-sample normalization methods that I could use on RPKM data? Thanks!
The problem is not the normalization, you can normalize RPKM in the same method DESeq uses. The DESeq model assumes the input is number of reads, it's pretty useless without it. Can't you get the raw reads?