Question

Scaling Rpkm/Fpkm Values Between 0 And 1

0

Entering edit mode

10.9 years ago

skm770 ▴ 150

Rpkm and fpkm values vary a lot some times from 10^-6 to 10^6. I have seen people comparing methylation with RNA-seq. Since methylation usually lies between 0 and 1 people scale the RPKM between 0 and 1. I am unaware of how do people do that and was wondering if there any r packages available to do the same.

thanks

rpkm fpkm • 5.3k views

ADD COMMENT • link 10.9 years ago by skm770 ▴ 150

1

Entering edit mode

Bizar question... people compare methylation with rna-seq? Do you mean associate methylation with expression? The fact that methylation values range between 0 and 1 (not always, it depends on what metric you use) doesn't mean rna-seq data needs to be in the same range. Yes, rpkm values can vary a lot (both within and between samples). More importantly, they are not normally distributed but log-transformed rpkm (do you mean fpkm?) are so start with that. However, I think you should contact a local bioinformatician to help you out. If you still want to scale rpkm values have a look into the scale()-function of R. Its in the R base functionality so no need to install any packages.

ADD REPLY • link 10.9 years ago by Irsan ★ 7.8k

0

Entering edit mode

What I meant by comparison is they see how expression is changing with RNA-seq usual ways they do it is in using box-plot with methylation and RNA-seq to see general patterns and dividing genes according to the methylation levels 10%,20%...100%.

ADD REPLY • link 10.9 years ago by skm770 ▴ 150

0

Entering edit mode

Ok so I guess you have whole genome expression and methylation data and you want to know wich genes are epigenetically regulated? Why do you want to know that? If a gene is epigenetically regulated does that make it more interesting? I'm not just giving you a hard time, I'm trying to get things clear so you get maximum response on your question

ADD REPLY • link 10.9 years ago by Irsan ★ 7.8k

0

Entering edit mode

Yes we would like to see if a gene is epigentically regulated/not or in this particular case what is the pattern of expression and methylation.

ADD REPLY • link 10.9 years ago by skm770 ▴ 150

0

Entering edit mode

Then for each gene, perform a linear regression of methylation-values on expression values of your complete cohort. Make sure your expression values are normally distributed (log2(FPKM)). Do false discovery rate adjustment on the resulting p-values and filter on fdr-adjusted < 0.05 and with negative slope (meaning more methylation results in less expression). if you want, you can use these genes again to do hierarchical clustering.

ADD REPLY • link 10.9 years ago by Irsan ★ 7.8k

0

Entering edit mode

can you explain how to "for each gene, perform a linear regression of methylation-values on expression values of your complete cohort", I'm very interesting, thanks

ADD REPLY • link updated 4.9 years ago by Ram 44k • written 10.5 years ago by Ares Cao ▴ 20