Skip to content
Prev 42968 / 398513 Next

random forest question

The classwt are used in the gini index for splitting nodes.  What we have
found (about two years ago) is that that option does not affect the
prediction as much as one would expect.  I suspect the problem is because
the trees are grown to maximum sizes and not pruned back.  This is why I
implemented the cutoff and sampsize options in randomForest().  Do make use
of them.  The classwt is there just for `old time sake', I guess...

BTW, 4.0-7 is current, and fixes a few bugs in 4.0-1.

BTW #2:  The convention is to direct questions specific to a package to the
package maintainer (me in this case) first, before posting to R-help.

HTH,
Andy
------------------------------------------------------------------------------
Notice:  This e-mail message, together with any attachments,...{{dropped}}