Rpkm In Masked Sequences
1
0
Entering edit mode
11.8 years ago
Leandro Lima ▴ 970

Hello.

I'm confused about calculating RPKM when the target sequence is masked.

In this post, they discuss about the use of "N", in

rpkm=10^9*C/NL, where C is the reads number of the transcript, L is the length of the transcript and N is the total reads count of the sample.

http://seqanswers.com/forums/showthread.php?t=7630

And what about L? Does it change, when the sequence is masked?

Thank you.

rpkm • 2.3k views
ADD COMMENT
1
Entering edit mode
11.8 years ago

If you mapped the reads to the masked sequence, then I would use the masked length to calculate the RPKM.

The rationale is that you are effectively removing reads that would have mapped to that masked region, so you should also remove the length of the masked region when normalizing by length.

ADD COMMENT

Login before adding your answer.

Traffic: 2550 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6