David For Rnaseq Data
3
2
Entering edit mode
12.9 years ago
Marina Manrique ★ 1.3k

Hi there,

I'm thinking about using DAVID for GO-term enrichment analysis in a set of DE genes from RNAseq. The thing is all the references I've found so far about using DAVID are for microarray data and some of them for CHIPseq data,

Besides, in the paper describing GOseq, they point out that

standard methods give biased results on RNA-seq data due to over-detection of differential expression for long and highly expressed transcripts.

so it makes me wonder if DAVID is really appropriate, have you used it for RNAseq?

Any feedback and reference is welcome :)

Thanks!

gene rna • 8.9k views
ADD COMMENT
4
Entering edit mode
12.9 years ago

David is not really appropriate for count-based data like RNA-seq for the reason that is given in the GOseq paper.

ADD COMMENT
2
Entering edit mode

I was going to cite this paper for further exploration, but upon a quick re-skim, it seems they (sadly) didn't do any analysis on the tag count vs. ability to call differential expression. Still useful to peruse, though ...

ADD REPLY
1
Entering edit mode

Some navel gazing in hopes of being a bit more thorough: if I recall correctly, the problem is the bias in RNA-seq to call differential expression and its relations tip transcript length. If you are using an "Tag-sequencing" method for gene expression analysis which doesn't have this bias (ie. SAGEseq, deepCAGE, or similar), "normal" GO analysis downstream of appropriate differential expression calls should suffice, no?

ADD REPLY
1
Entering edit mode

This one might be closer? http://genomebiology.com/2010/11/10/R106. More analytically, think of the counts as poisson-distributed (they are not, but the approximation is not too far off for low-counts), so the mean and variance are equal. As the number of counts increases, the distribution becomes tighter. I'm no statistician, but hopefully the point is coming across.

ADD REPLY
0
Entering edit mode

Highly-expressed genes are more likely to be called differentially-expressed than low-expressed genes, also. This effect is independent of transcript length bias, so you are still not on firm ground with SAGE or CAGE. I do not know how biased results will be in practice, though.

ADD REPLY
0
Entering edit mode

Thanks a lot for the comments!

ADD REPLY
0
Entering edit mode
9.6 years ago

Hi, I was thinking of using DAVID software with RNAseq data BUT only for selecting the common DEGs that appear in different RNAseq experiments with different samples.

I mean, I have RNAseq data from the comparison of two samples, and another RNAseq data from the comparison of two different samples. And I want to know the common DEGs of both RNAseq experiments, without taking into account their functional characteristics. Do you know if DAVID will solve my problem? Or if there is another software to do that?

I only want to know which genes are differentially expressed commonly in all the comparisons.

ADD COMMENT
0
Entering edit mode
9.6 years ago
andrew ▴ 560

David has not updated its databases since 2009. You might consider iPathwayGuide. It's completely free to try. You can access it from www.iPathwayGuide.com

ADD COMMENT

Login before adding your answer.

Traffic: 1790 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6