Skip to content
Prev 164140 / 398506 Next

Pre-model Variable Reduction

Thank you everyone.
The idea really is for me to get the variables themselves from a
super-set of all variables.
x1 -numeric continuous
x2 -numeric continuous
x3 - numeric Factor with 2 levels
x4 -Character Factor with 10 levels
x5 - numeric continuous
x6 - numeric integer

Variable Reduction method then, must ideally give me

keep : x1, x3 and x6
drop : x2, x4 and x5

The 'redun' function from Hmisc package seems promising since it
considers categorical variables as well. Variable to be dropped is the
variable which can be predicted by other variables. I guess its to
check for multi-colinearity.

The RWeka package, as I mentioned earlier, allows one to use Weka's
variable reduction/selection techniques  in R. I did come across an
implementation of the "Genetic Search' method, but have not been able
to find relevant documentation for the same to tweak to suit my needs.

Thank you all for your time.

Harsh Singhal
Decision Systems,
Mu Sigma Inc.
On Tue, Dec 9, 2008 at 8:05 PM, Ravi Varadhan <RVaradhan at jhmi.edu> wrote: