Filtering genes after TPM normalization
1
0
Entering edit mode
17 months ago
JACKY ▴ 160

Filtering genes is one of the steps that I do in all my analysis. I usually filter the counts RNA-seq data according to a specific threshold. In my current work, I have no access to the counts data, I only get the normalized data. All data is normalized with TPM approach.

I'm wondering if it is still a good idea to filter genes even though the data is normalized? if so, what would be a good approach for that? since almost every package in R filters counts data, and not TPM data.

Thank you!

filtering normalization TPM r • 898 views
ADD COMMENT
1
Entering edit mode
17 months ago
Meisam ▴ 250

While not a fan of it, I’ve seen papers where authors used a cpm or tpm value cutoff of 1 to filter their count table. Since this threshold would be anyhow arbitrary, you might want to use your best guess for what roughly the total reads per sample could be. Like if you know they used a MiSeq machine and you have three samples you would expect to have around 3-5 million reads per sample before normalization.

ADD COMMENT

Login before adding your answer.

Traffic: 2759 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6