Regarding Normalization Of Rna-Seq Data: Use People Total Number Of Reads Per Lane Or Total Number Of Mapped Reads Per Lane?
1
1
Entering edit mode
13.1 years ago
Steffi ▴ 580

As I am not sure that all people are aware of the difference...

rna normalization • 4.3k views
ADD COMMENT
1
Entering edit mode
13.1 years ago
Ido Tamir 5.2k

I think every sane person is aware of the difference. RPKM were defined in Mortazavi as: "transcript levels in reads per kilobase of exon model per million mapped reads (RPKM)". It does not make sense to use the total number of reads. This it the most simple way to normalize RNA-Seq data. More advanced methods are being developed (eg. cqn and references therein - not used yet by myself).

It does not make any sense to use the number of reads that come out of the machine because this includes artifacts (adapter dimers), reads of low quality with errors that don't map to the genome etc....

I would also take care of reads (e.g. rRNA) that vary between preparations and can be mapped to the genome uniquely and the effect this has on RPKM values.

ADD COMMENT
0
Entering edit mode

Agree. It does not make any sense to normalize per number of reads coming from a lane (same for fractions of that if that is you do multiplexing). Normalize by number of mapped reads or use more sophisticated methods like RPKM.

ADD REPLY
0
Entering edit mode

I am not so sure that normalizing by total number of mapped reads is that sensible. What you really want to do is to account for different sequencing depths in multiple samples. Normalizing by total number of mapped reads implies that one biases the sequencing depth estimate by the mapping choice.

I am not happy with normalizing by total number of reads either. I quite like the approach of DESeq where the use the idea of a virtual reference sample.

ADD REPLY
0
Entering edit mode

I did not say its sensible but the most simple one. the linked paper gives references to other methods.

ADD REPLY

Login before adding your answer.

Traffic: 2551 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6