Skip to content
Prev 241950 / 398500 Next

statistical test for comparison of two classifications (nominal)

Another useful measure to compare partitions is the adjusted Rand
index which is implemented in the library(e1071) within the
classAgreement function.
If you have your data partitions to be compared in a matricial form
(where each column is a different partition), the syntax is
ARI<-classAgreement(table(data[,i],data[,j]))$crand

Other useful measures of goodness-of-fit for clustering are the
silhouette index or the c-index or the Goodman-Kruskal index. although
they evaluate in general inter/intra-cluster distance distributions.
For instance, you can maximise/minimise these indices to find the best
partition among a set of candidate ones.

Mattia Prosperi.


2010/11/17 Marc Schwartz <marc_schwartz at me.com>: