Question

LIMMA analysis with 2 samples in comparison group

1

Entering edit mode

6.2 years ago

Sebastian Hesse ▴ 350

In my differential expression analysis dataset (comparing promotes of healthy cells vs diseased cells of patients with defined genotypes) I have a few genotypes with only 2 samples instead of the classically used 3.

To give it a try I included them in the makeContrast for LIMMA and it runs without complain. Also, if I check for proteins I would expect to be lower expressed like the ones mutated in the patients, they are significant in those groups (and even specifically in those groups only).

So my question is if this approach is ok and valid or a total NO GO that won't be accepted by any reviewer.

Thanks for your comments! Sebastian

limma r proteomics • 3.1k views

ADD COMMENT • link 6.2 years ago by Sebastian Hesse ▴ 350

1

Entering edit mode

Thanks for your comments, unfortunately I am working on a very rare disease (congenital neutropenia) and its feels already quite an accomplishment to have 8 of the disease genotypes in my cohort. So unfortunately I won't get more and there are no other datasets as we are the first performing this kind of analysis - but thanks a lot for your comments. I will go on with the study and validate the proteome findings on genetic level. But good to hear that you actually consider it ok (with reservations) and don't reject it straight out :)

ADD REPLY • link 6.2 years ago by Sebastian Hesse ▴ 350

0

Entering edit mode

Especially in your case, with rare diseases, it would be acceptable to use n=2 I think. Good luck.

ADD REPLY • link 6.2 years ago by Benn 8.4k

0

Entering edit mode

For a given gene, the within-group variance is assumed constant across the groups. And you've a range of different groups, some (? most) with >= 3 samples, so this isn't a classical n=2 experimental-design. Even within a given n=2 versus n=2 contrast, the study's a bit better than it would be if you only had the samples for those two groups. As a result, you can get pretty good estimates in the expeirment as described and it should be acceptable.

ADD REPLY • link 6.2 years ago by russhh 5.8k

score 1 · Answer 1 · 2019-01-18

1

Entering edit mode

6.2 years ago

Benn 8.4k

n=2 experiments are as low as you can go. I mean some even do n=1 experiments with edgeR, and some might even convince reviewers/editors to publish that, but if I have to review n=1 experiments (proteomics or RNA-seq) I would reject the paper for that fact. n=2 is acceptable, but of course always better to include more.

ADD COMMENT • link 6.2 years ago by Benn 8.4k

0

Entering edit mode

I agree with b.nota's comments. It is also difficult to say if your work using n=2 would be accepted or not. There are 1000s of journals and I feel that it can be 'hit and miss' if you get a 'generous' editor and peer reviewers. Of course, science should be transparent and one should have the same experience at all journals, but this is not the case at all.

Could you not at least attempt to validate the finding in an online, already-published dataset?

ADD REPLY • link 6.2 years ago by Kevin Blighe 89k