Skip to content

cluster summary score

2 messages · Huan Huang, Frank E Harrell Jr

#
Dear Prof. Harrell and R list,

I have done the variable clustering and summary scores. Thanks a lot for
your kind help.

But it hasn't solved the collinearity problem in my dataset. Afer the
clustering and transcan, there is still very strong collinearity between the
summary scores. The objective of my project is to find out the influential
variables. I believe any variable resuction is not appropriate when the
collinearity exists. I am thinking about the principal component regression
and variable reduction based on it (Rudolf J. Freund and William J. Wilson
(1998), P215).

Does anybody have suggestion on the variable resuction under this condition?
I will appreciate any kind imformation.

Best

Huan
----- Original Message -----
From: "Frank E Harrell Jr" <fharrell at virginia.edu>
To: "Huan Huang" <huang at stats.ox.ac.uk>
Sent: Sunday, August 04, 2002 7:56 PM
Subject: Re: cluster summary score
can I
3,5,7
with
(e.g.,
have
on
how
Design
the
this if
likelihood
ability,
outputs
Hmisc
cross-sectional
to
analoque to
vaguely
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._.
Statistics
Sciences
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._.
http://hesweb1.med.virginia.edu/biostat
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
#
This is confusing because if you do the variable clustering correctly, the cluster scores should be weakly correlated.  Check how you are doing the variable clustering and how you are interpreting measures of collinearity.
-Frank Harrell

On Thu, 8 Aug 2002 13:23:05 +0100
Huan Huang <huang at stats.ox.ac.uk> wrote: