Hi,
I have a matrix, with a lot of columns/features, I want to predict one of this column (outcome) using the other ones. i am very new in doing this kind of things, and I don't know if I can find exactly what I want, so sorry for my silly question in that case.
My intention is to find the features that contribute more to predict my variable. All of them are numeric, although I can transform to discrete values.
Any suggestion how to start, or what method to use. I am playing with WEKA, but since it integrates a lot of algorithms I don't know exactly what it means each of the parameters in the results.
Also I played with linear regression, but I don't know how to find the best model (have I to play with different number of combinations of all the features?) and neither the coefficients are a direct value to look at and to assume that this feature contribute more or less.
As an aside, scikit in python has a surprisingly good tutorial for this kind of thing: http://scikit-learn.org/stable/index.html
that's great, i use python normally for my scripts. so it is perfect if I decide the model first!