To do two kind of variables to do differential expression analysis

0

Entering edit mode

6.3 years ago

1106518271 ▴ 60

A toy example:

health mouse: tissue a(replication 3), tissue b(replication 9).
disease mouse: tissue a(replication 5), tissue b(replication 6).

For health_a_1, health_a_2, health_a_3, health_b_1...each do RNA-seq, got their RPKM list

I hope to see tissue a has differences in expression between health and disease, tissue b has differences in expression between health and disease.

How to know there any differences in expression? I know one factor, but here two: health condition and tissue. I think here shouldn't study independent, use model y=ax1 + bx2 + cx1x2?

Some suggestion? Thanks!

RNA-Seq R next-gen • 1.2k views

ADD COMMENT • link updated 6.2 years ago by Biostar 20 • written 6.3 years ago by 1106518271 ▴ 60

2

Entering edit mode

First off, RPKM values are not suitable for sound statistics. They (all bioinformaticians) recommend to use raw read counts. When you have raw read counts you can continue for example in limma (voom or trend) to make designs like yours. Read the manual, it is pretty well written (also for beginners).

ADD REPLY • link 6.3 years ago by Benn 8.3k

0

Entering edit mode

Thanks! What's more, for id list (my row of matrix), recommend use mRNA id expresion or gene id expression in general?

ADD REPLY • link 6.3 years ago by 1106518271 ▴ 60

1

Entering edit mode

Try to use something like featureCounts, with gene annotation from ENSEMBL. Then you'll have ENSEMBL gene names, which you can convert later to anything you want.

ADD REPLY • link 6.3 years ago by Benn 8.3k

Login before adding your answer.