Question

Is it correct to pool transcriptomes coming from different experiment for DE analysis?

0

Entering edit mode

6.9 years ago

keepclam ▴ 10

Hello everybody,

I downloaded from SRA all available transcriptome experiments from a nonmodel organism in order to perform comprehensive analyses. I want to analyze both sequences and differential expression. While the former seems feasible with different experiments, the latter concerns me a bit.

Is it methodologically safe to compare differential expression levels across different experiments? All of them are performed on Illumina machines, but with different technology levels (2000 / 2500 / 4000) and with different coverage.What steps do I have to perform in order to compare their transcription level? Can I pool them and do some sort of normalization?

If you'd have some reading to suggest me (papers that do an analogous analysis) it would be excellent as well.

RNA-Seq differential-expression • 1.1k views

ADD COMMENT • link updated 6.9 years ago by WouterDeCoster 48k • written 6.9 years ago by keepclam ▴ 10

0

Entering edit mode

Q: Is it methodologically safe to compare differential expression levels across different experiments?

A: I did this exercise myself. In summary, I think it is possible, but it is usually non-trivial. The answer depends a lot on the kind of differential expression analysis that you are want to do. The level of differential expression of a gene usually depends on both the technical and experimental covariates. If the experimental covariates are actively driving the differential expression of a gene and the DE has huge effect size, it generally shows in both studies seamlessly, and sometimes the expression profiles from different studies cluster very naturally after a z-score. But often the case is that the expression profiles cluster by studies instead. And when that happens, there are Combat or some other batch correction tools that you can try using. If yeah, there are caveats associated with batch correction too......

ADD REPLY • link 6.9 years ago by btsui ▴ 300

score 2 · Answer 1 · 2018-08-27

Is it methodologically safe to compare differential expression levels across different experiments?

No. For transcriptomics, everything matters. The kit used for library prep, the kit used for RNA extraction, the technician performing the library prep, the temperature in the lab, probably what the technician had for lunch and the music in the lab.

You could do proceed with caution if you have something like the following setup:

Experiment A has 5 samples for condition 1 and 4 samples for condition 2
Experiment B has 3 samples for condition 1 and 5 samples for condition 2

Then you can have the "unknown experimental artefacts" as a batch effect between those experiments in your design formula.

If you have:

Experiment A has 5 samples for condition 1
Experiment B has 5 samples for condition 2

Then you can never compare condition 1 with condition 2 because it is confounded by the experimental artefacts.