Skip to content

Classification Analysis

2 messages · Huntsinger, Reid, Rishabh Gupta

#
This is a very general problem and a very large area of statistics/computer
science/etc is concerned with it. R provides lots of possibilities; you
might find tree-based approaches (recursive partitioning) to suit your
needs; in that case, rpart and the new random forests package will be of
interest. Also see package e1071 and the VR packages for starters. There are
lot of other possibilities; you might want to have a look at Ripley, Pattern
Recognition and Neural Networks, for example, to see some.

Reid Huntsinger

-----Original Message-----
From: Rishabh Gupta [mailto:rg117 at ohm.york.ac.uk]
Sent: Tuesday, April 16, 2002 11:14 AM
To: r-help at stat.math.ethz.ch
Subject: [R] Classification Analysis


Hi everyone,
    Could somebody explain to me what is the package/function for
classification analysis. I am performing analysis of music files in the form
of MIDI files. I end up with about 750 dependent variables from the
analysis, I also have a number of independent/grouping variables that I set
manually. What I would like is to be able to predict which group a
particular MIDI files belongs to given the 750 dependent variables. In order
to this I have to perform classification analysis on a sample set of MIDI
files where I know what group they belong to. I want to extract the
'classification rule' that would enable me to predict the group of each MIDI
file (there would be a different classification rule for each grouping
variable). Can anybody explain what is the best way of doing this in R. What
is the best package/function that would enable me to perform classification
analysis.

Any help would be greatly appreciated.

Many Thanks For Your Help!

Rishabh

-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.
-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._.
_._


------------------------------------------------------------------------------
Notice:  This e-mail message, together with any attachments, contains information of Merck & Co., Inc. (Whitehouse Station, New Jersey, USA) that may be confidential, proprietary copyrighted and/or legally privileged, and is intended solely for the use of the individual or entity named in this message.  If you are not the intended recipient, and have received this message in error, please immediately return this by e-mail and then delete it.

==============================================================================

-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
#
Thanks for your reply.
I am still learning these aspects of statistical analysis, so if I don't
make sense please forgive me.
All this work I am doing is part of a Phd and my deparment does not really
neural network for situations like these because they say it acts like a
black box and we don't really know what is happening in the inside. So I
would like to concentrate on using pure statistical techniques. Ideally what
I would like is some kind of a function that calculates the "classification
rule" when given a grouped data set :
    rule <- classification( GroupVar ~ DepVar1 + DepVar2 + DepVar3 + DepVar4
..... + ...... DepVarX )

Then I would be able to use that rule and apply to a single data element for
which the group is not known:

    theGroup <- rule( DataElement )

I have looked at the rpart package in R but I am not entirely sure how to
use in a way that I can create the "classification rule" and then use that
rule. I understand that it is a general problem but it's made more difficult
for me because I have to deal with 750 dependent variables.

Your help is greatly appreciated.

Many Thanks

Rishabh

----- Original Message -----
From: "Huntsinger, Reid" <reid_huntsinger at merck.com>
To: "'Rishabh Gupta'" <rg117 at ohm.york.ac.uk>; <r-help at stat.math.ethz.ch>
Sent: Tuesday, April 16, 2002 5:00 PM
Subject: RE: [R] Classification Analysis
statistics/computer
are
Pattern
form
set
order
MIDI
What
classification
-.
http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._.
----
information of Merck & Co., Inc. (Whitehouse Station, New Jersey, USA) that
may be confidential, proprietary copyrighted and/or legally privileged, and
is intended solely for the use of the individual or entity named in this
message.  If you are not the intended recipient, and have received this
message in error, please immediately return this by e-mail and then delete
it.
============================================================================
==
-.-.-
http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._.
_._
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._