Skip to content
Prev 290609 / 398498 Next

Question about randomForest

Let me make this as clear as I possibly can:  If predict() is called without newdata, all it can do is assume prediction on the training set is desired.  In that case it returns the OOB prediction.  If newdata is given in predict(), it assumes it is "new" data and thus makes prediction using all trees.  If you just feed the training data as newdata, then yes, you will get overfitted predictions.  It almost never make sense (to me anyway) to make predictions on the training set.
Possibly, depending on how "close" the new data are to the training set.  This applies to nearly _ALL_ methods, not just RF.

Andy
11311p4529770.html
Notice:  This e-mail message, together with any attachme...{{dropped:11}}