Filtering during RNA Seq analysis?
2
0
Entering edit mode
8.3 years ago

Hi,

I have roughly 100 samples (2 groups) of noisy human tissue data. Thinking about filtering to improve the multiple testing situation. Thus far, I filter by cpm values (calculated using edgeR after counting with featurecounts), and run voom/limma afterwards. I have also stumbled over htsfilter, and thought about variance filtering.

Now I heard that filtering, at least by variance or htsfilter, was per se incompatible with limma or other empirial Bayes methods.

What do you experts think/suggest?

Many thanks!

RNA-Seq • 3.2k views
ADD COMMENT
4
Entering edit mode
8.3 years ago

You might also try "Independent Hypothesis Weighting" as implemented in the IHW Bioconductor package and described it this paper: http://www.ncbi.nlm.nih.gov/pubmed/27240256

Instead of filtering genes to conserve power, it weights them. It is compatible at least with DESeq2 and is used in their vignette (i suspect it can be used in conjunction with other packages as well).

ADD COMMENT
3
Entering edit mode
8.3 years ago

The package you're looking for is genefilter. As an aside, that's used automatically by DESeq2.

BTW, filtering by variance isn't compatible with any RNAseq package that I'm aware of (variance is convoluted with the DE test).

ADD COMMENT
0
Entering edit mode

Thanks guys for your suggestions. I will definitely also try DESeq2. I have the feeling that it is quite en vogue at the moment, is that correct? Thus far, I primarily use limma, because I was brought up with arrays, and usually need to correct for random and fixed factors.

ADD REPLY

Login before adding your answer.

Traffic: 1909 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6