Rna-Seq And Microarray Classification
1
2
Entering edit mode
11.1 years ago

Hi All,

I am interested in developing a prognostic classifier in the following way:

  1. Use a large RNA-Seq cohort with SVM and LOOCV with RFE to develop a prognostic classifier that is able to distinguish between two phenotypes.
  2. I want take this classifier and test it on a different Microarray cohort to see how well these genes can discriminate between the same two phenotypes as described within the RNAseq data.

Since the distributions of these two data-types vary, we simply variance stabilize transform the counts which makes it more normal instead of neg. binomial.

The first point above is straightforward. My question is within the second point above. How can one take a classifier that was already independently created and validated to distinguish between two phenotypes using one cohort (RNAseq) but then test the accuracy of that multi-gene classifier to distinguish between the same two phenotypes using another cohort (microarray)?

Yours,

M.B.

rnaseq microarray classification • 3.8k views
ADD COMMENT
0
Entering edit mode

This is probably not a solution to your problem, but have a look at this paper, where they also have to address the multiplatform issue. Maybe that'll at least provide a springboard for a solution.

ADD REPLY
0
Entering edit mode

Classification with binary gene-expression was an idea. I see it was put out there in 2009 but I was completely unaware of this paper. Cheers, this may provide useful for us.

ADD REPLY
0
Entering edit mode

If you switch classification methods to work with a score for relative expression between two groups, this might also help.

For example, I was able to apply a classifier from a microarray dataset to an RNA-Seq dataset using BD-Func:

https://peerj.com/articles/159/

ADD REPLY
0
Entering edit mode

Cheers guys. I'll meditate a bit on these approaches and test them out. I still think it is quite surprising though that still you can not take a classifier from one study and put it on to another, using expression, to see how it performs.

ADD REPLY
2
Entering edit mode
11.1 years ago

I think the answer is that this is unfortunately very hard to do.

The classifier must have the same sort of input for training and validation. You can try quantile normalizing the RNA-Seq and microarray data, but I have found this typically doesn't work very well (however, it is still the best suggestion that I can come up with).

ADD COMMENT
0
Entering edit mode

Thanks. Although, I wouldn't want to normalize the both them. It's my impression that in order to make the two distributions somewhat similar, RNASeq needs to be variance stabilise transformed and a covariate shift needs to be used in order to make the two distributions more similar. Although these methods are being working out.

ADD REPLY
0
Entering edit mode

Hi Michael, I'm actually addressing the exact same question, going from training on RNA-Seq to testing on micro-array. I confirm that this look like an open question. I've had some success by applying voom on the RNA-Seq data (from package limma, perform a log-transformation of the data by accounting for different library sizes). Did you tried this also?

ADD REPLY

Login before adding your answer.

Traffic: 1003 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6