Entering edit mode
9.3 years ago
Ron
★
1.2k
Hi all,
I want to do RANK based normalization in RNA seq expression data,basically ranking genes in a sample according to FPKMS and dividing by the total number of genes to get a normalized expression value between 0 and 1.
This has to be done across multiple samples.Is there any method to do this?
I have used this method,seems to work,but want to look at rank based method.
gene_min=apply(df, 1, min)
gene_max=apply(df, 1, max)
df_norm=(df-gene_min)/(gene_max-gene_min)
Thanks,
Ron
Hi Ron, what have you tried. A very simple method will be to apply a small filter (threshold) on the FPKM values, scale the genes between 0-1 and then just sort, very easy to do in R. Let us know, if you get stuck, I can write a function for you.
I have added the method I am using,which is a bit different although it scales values between 0 and 1.