Skip to content
Prev 22493 / 398502 Next

cluster-analysis and NA's

Hi,
On Wed, 7 Aug 2002 chr.schulz at email.de wrote:

            
The choice of a clustering method depends very much on the aims of your
analysis and especially on your concept of "similarity" between the
entities to cluster. One of the most often used methods for clustering
variables is to compute a dissimilarity matrix between the variables, where
dissimilarity is defined as 1-correlation or 1-|correlation| (depending on
if you want variables pointing in almost exactly the opposite direction 
to be interpreted as similar or as extremely dissimilar).
Then you can apply one of the algorithms of hclust or agnes of library
cluster (the latter is not a bad choice as a default). 
NAs are not a big problem because correlations can be properly computed in
presence of NAs.

Christian