Hi,
I'm doing a presentation on Neural Networks and Tree-Based Models in two
weeks, at the moment I'm looking for a data set to use in the
presentation. What I would like to use is a good old data, like the Iris
data, that is already known by every statisticians.
MASS4 uses the cpus data in Chapter 8.10 and the Cushing's syndrome in
Chapter 12.4. These two data sets plus the Iris data I have mentioned
make three possible candidate data sets. Does anyone has a good
recommendation as to which data set is better?
While I'm at it. Is it technically correct to obtain (using residuals())
the residual sum of square from the nnet() and rpart() models. Then say
one is better than the other based on the statistic?
--
Cheers,
Kevin
--------------------------------------------------------------------------