Hi everyone, I'm trying to perform an SVM-RFE feature selection using the caret
package on R: I have a dataset with 1000 and more features (miRNA expression counts, normalized) as columns (+ one column with the class, normal vs tumor) and few hundreds of samples as rows.
I've found that the rfe
function can be used for this purpose, but I'm not sure how to set the arguments.
In particular I cannot understand what does the sizes
argument mean: just some features will be used for the model? does it make sense when I need to use RFE for selecting them?
And what does happen if I do not set it? I didn't understand this point.
I tried to set the rfeControl
first, then, I set the rfe
, but I don't know if this is the correct way to do it
control <- rfeControl(functions = caretFuncs, #It is correct for svm-rfe?
method = "repeatedcv",
number = 10,
verbose = FALSE)
recursivefs <- rfe(dataset[,-1], dataset$class,
method="svmLinear",
rfeControl = control)
I set the argument method="svmLinear" as I found that it determs the model used, but I'm not sure about this; moreover, I don't know what to set on "sizes". Can anyone help? I'm at the very beginning with feature selection, so every advice will be very appreciated. Thank you.