Skip to content

randomForests predict problem

2 messages · Liaw, Andy, Torsten Hothorn

#
Yves,

I will add checks for NAs in predict.randomForest().

In the next version of randomForest (currently called 3.9-x), there will be
facilities for handling NAs in the training set.  However, there's no way to
handle NAs in the test set yet.  I believe Leo is still working on that.

In Leo's v.4 of the Fortran code, he uses proximity from random forest to
iteratively impute NAs, starting with column median or mode (depending on
variable types).  I've implemented this scheme at the R level, so that it
works for both regression and classification.

There are a couple of things in Leo's new code that I have not added to the
package, and that's why the version is 3.9 rather than 4.0.  If you would
like to test the new code, please let me know.

Cheers,
Andy
------------------------------------------------------------------------------
#
On Wed, 2 Apr 2003, Liaw, Andy wrote:

            
yes, sure!

best,

Torsten