Question

Constructing a mean ROC curve based on 5 iterations

0

Entering edit mode

7.0 years ago

Swimming bird ▴ 20

I did a 5-fold cross-validation for testing a binary classifier. Using the results, I constructed a ROC curve for each iteration with ROCR package:

Is there any way to construct a mean ROC curve based on these 5 curves?

roc R rocr • 2.9k views

ADD COMMENT • link updated 7.0 years ago by Collin ▴ 1000 • written 7.0 years ago by Swimming bird ▴ 20

0

Entering edit mode

This seems off-topic for biostars? Perhaps more appropriate to ask on stats.stackexchange.com

ADD REPLY • link 7.0 years ago by Len Trigg ★ 1.7k

score 2 · Answer 1 · 2018-06-28

2

Entering edit mode

7.0 years ago

Collin ▴ 1000

What happens in 5-fold cross-validation is that you train on 4 of the folds and predict on the hold-out fold. This procedure is repeated until all folds have hold-out predictions on them. Given this preamble, you could just concatenate the scores and labels from the 5 hold-out predictions (which should comprise all of your data points), and then construct a single ROC curve based on that.

Ideally with enough data and randomization of which samples fall into each fold, there should not be large discrepancies between the ROC curve of each fold individually.

ADD COMMENT • link 7.0 years ago by Collin ▴ 1000

0

Entering edit mode

What you mean is to join the results of each interation in a single vector, right? The problem is that I am using a program that doesn't give a score of each datapoint, it only sorts the data points. Therefore, I can't join the results maintaining the rank...

ADD REPLY • link 7.0 years ago by Swimming bird ▴ 20

1

Entering edit mode

Yes, I meant to concatenate them into a single vector. Nearly all methods provide a score which is used for ranking. It's odd that it's not provided as output. They may have an internal score that is just not getting returned to the user. You might need to contact the authors or see if you can modify their code to return the score as an output.

ADD REPLY • link 7.0 years ago by Collin ▴ 1000

0

Entering edit mode

Yes, it's internally calculated but I can't change the code. I'll try to talk with the authors.

ADD REPLY • link 7.0 years ago by Swimming bird ▴ 20