Skip to content

randomForest Tutorial

3 messages · Dirk Nemitz, Jim Porzak, Hans W Borchers

#
Dirk,

Check out Andy Liaw/Mat Wiener's article in RNews 2002-3
www.r-project.org/doc/Rnews/Rnews_2002-3.pdf

Also my decks at www.porzak.com/JimArchive :
1. http://www.porzak.com/JimArchive/JimPorzak_CIwithR_useR2006_tutorial.pdf
2nd part is on randomForest
2. http://www.porzak.com/JimArchive/JimPorzak_RFwithR_DMAAC_Jan07_webinar.pdf
is all on randomForest

The latter two focus on the use of randomForest in marketing & on the
binary response problem

For the variable selection problem you mention, check out Ramon
Diaz-Uriate's package varSelRF on your local CRAN.

HTH,
Jim Porzak
Responsys, Inc.
San Francisco, CA
http://www.linkedin.com/in/jimporzak
On Tue, Jul 22, 2008 at 4:58 AM, Dirk Nemitz <dirk.nemitz at gmail.com> wrote:
#
The help pages for 'randomForest' are quite clear in stating how to use the
'randomForest()' function. Input is a data frame (not a csv-file) and a
formula involving attribute names.

There is a kind of tutorial for classification and clustering with Random
Forests on Leo Breiman's web page
<http://www.stat.berkeley.edu/users/breiman/RandomForests/cc_home.htm>.  It
describes the older code on which the R package is built, not the
implementation of RandomForest from Salford systems, so more or less what
you want.

And there is another implementation of random forests in the 'party'
package, see the 'cforest()' function. (The package varSelRF has already
been mentioned in a previous post.)

Hans Werner Borchers
ABB Corporate Research
Dirk Nemitz wrote: