Question

Finding Differentially Expressed Genes Across Multiple Cell Types In A Datset

2

Entering edit mode

11.6 years ago

Bioinfo_2006 ▴ 160

I have a transcriptome dataset from affymetrix microarray and has multiple cell types which were probed. I would like to obtain the differentially expressed genes from each of the cell types (i.e significant transcripts) using packages in R. My data is in the form of a matrix with genes and samples as rows and columns respectively. I tried using the csSAM package but did not quite know how to do it. Is there any other way or package to do this ?

r differential-expression • 6.3k views

ADD COMMENT • link updated 11.6 years ago by Sean Davis 27k • written 11.6 years ago by Bioinfo_2006 ▴ 160

0

Entering edit mode

Nobody knows "quite know how to do it" the first time :) With what, precisely, are you having trouble? Have you read the documentation? Is there an example or tutorial that you are trying to follow? Are you seeing an error message that you don't understand? It's easier to help if your question is better defined.

There are lots of R packages for this task - limma is popular and well-documented - but in all cases you need to make some effort to figure out how they work.

ADD REPLY • link 11.6 years ago by Neilfws 49k

0

Entering edit mode

True. I should have given a clearer example.

My dataset looks this way:

Probes  celltype1  celltype2 celltype3 celltype4

gene1   5.098      4.677      7.456      8.564  
gene2   8.906      6.653      6.754      7.546
gene3   7.409      5.432      6.724      5.345
gene4   4.876      5.981      4.290      4.267
gene4   3.567      3.425      8.564      6.345 
gene5   2.569      8.645      5.234      7.345

The dimension is 22810(genes) x 20(cell types).

I tried to use the csSAM package in R.

csSamWrapper(G, cc, y, nperms = 200,alternative = "two.sided", standardize = TRUE,
medianCenter = TRUE, logRm = FALSE, logBase = 2,nonNeg = TRUE, fileName = "csSAMout.pdf")

where G is my expression matrix where I have the genes as rows and cell types as columns. and cc stands for the Matrix of cell-frequency. (n by k, n samples, k cell-types).

The example from the package had several samples.

I am not clear as how to construct the cell frequency matrix because I have only one sample.

ADD REPLY • link updated 11.6 years ago by Neilfws 49k • written 11.6 years ago by Bioinfo_2006 ▴ 160

0

Entering edit mode

With only one sample per cell type, you are in uncharted territory. What types of arrays were used to generate these data?

ADD REPLY • link 11.6 years ago by Sean Davis 27k

0

Entering edit mode

Affymetrix arrays ( Arabidopsis ATH1 chip)

ADD REPLY • link 11.6 years ago by Bioinfo_2006 ▴ 160

0

Entering edit mode

Are you sure the data you have shown is one sample per celltype i,e, only one column for each cell type or is it some sought of average you are using??

ADD REPLY • link 11.6 years ago by k.nirmalraman ★ 1.1k

0

Entering edit mode

Yes, it is after the average of three replicates

ADD REPLY • link 11.6 years ago by Bioinfo_2006 ▴ 160

2

Entering edit mode

You'll definitely want to get the data in the un-averaged form. With the averaged replicates, you will be unable to apply statistical methods to determine differential expression (since you have no replicates). After you have the data with three replicates per cell type, I will refer to my answer below for multi-group comparison using limma.

ADD REPLY • link 11.6 years ago by Sean Davis 27k

0

Entering edit mode

Thank you. Yes, I am trying out different contrasts that I could apply to get what I want. I think Limma would be the best option to do this.

ADD REPLY • link 11.6 years ago by Bioinfo_2006 ▴ 160

0

Entering edit mode

So if I try this particular contrast as follows: I have cell types A, B ,C,D I'd use limma and define a contrast like A - (B+C+D)/3", contrast.matrix<-makeContrasts("p1-(p2+p3+p4)/3",levels=design) I tried this and got some DEG's. So will that be those ones only in A and not in others ?

ADD REPLY • link 11.6 years ago by Bioinfo_2006 ▴ 160

score 2 · Answer 1 · 2013-04-17

2

Entering edit mode

11.6 years ago

Sean Davis 27k

Check out section 8.3 in the limma user guide here:

http://www.bioconductor.org/packages/release/bioc/vignettes/limma/inst/doc/usersguide.pdf

You'll probably want (need) to read the user guide and help pages associated with limma functions, also.

ADD COMMENT • link 11.6 years ago by Sean Davis 27k

score 1 · Answer 2 · 2013-04-17

1

Entering edit mode

11.6 years ago

k.nirmalraman ★ 1.1k

There is a little bit of discussion Rna Seq Cell Type Specificity that concerns this question though raised for RNA Seq.... I am sure these methods can be used for microarrays as well

ADD COMMENT • link 11.6 years ago by k.nirmalraman ★ 1.1k