Skip to content
Prev 156055 / 398503 Next

Ward's Clustering Doubts

Hi Rodrigo,

[apropos of Ward's method]
Strictly speaking this is probably correct, as Ward's method does an
analysis of variance type of decomposition and so doesn't really make much
sense  (I think) unless Euclidean distance (i.e. least-squares) is used.

However, there may be ways around this. First, because a distance metric is
non-Euclidean does not mean that it is always non-Euclidean. You can test
this using ?is.euclid in package ade4. You can also test your matrix by
doing a principal co-ordinate analysis; then look for negative eigenvalues.
If none are found, the matrix is Euclidean and it should be OK to use Ward's
method on that data set.

Probably a better approach is to make your distance matrix Euclidean. There
are several functions in ade4 that will do this. The idea then is to
present/compare the two solutions: the first using the uncorrected,
non-Euclidean distance matrix, the second using the corrected version. You
could use procrustes/co-inertia analysis to compare the two in an
intermediate step.

Regards, Mark.
Rodrigo Aluizio wrote: