Skip to content

PAM Clustering Ignores Cluster Number Parameter

2 messages · Dario Strbenac, Martin Maechler

#
I am using PAM with k = 10 clusters, but I only get one cluster ID for all my observations. I couldn't find any discussion about this in the help file, or mailing lists.

Is there a reasonable explanation for this result ?

cIDs <- pam(all, 10, cluster.only = TRUE, do.swap = FALSE)
cIDs
    0 
16671

The matrix of observations can be found at : http://129.94.136.7/file_dump/dario/all.obj

I'm using R version 2.13.0 (2011-04-13) on Platform: x86_64-unknown-linux-gnu (64-bit) and have cluster_1.13.3.

--------------------------------------
Dario Strbenac
Research Assistant
Cancer Epigenetics
Garvan Institute of Medical Research
Darlinghurst NSW 2010
Australia
6 days later
#
> I am using PAM with k = 10 clusters, but I only get one cluster
    > ID for all my observations. I couldn't find any discussion about
    > this in the help file, or mailing lists.  Is there a reasonable
    > explanation for this result ?

    > cIDs <- pam(all, 10, cluster.only = TRUE, do.swap = FALSE)
    >> table(cIDs)
    > cIDs
    > 0 
    > 16671

    > The matrix of observations can be found at :
    > http://129.94.136.7/file_dump/dario/all.obj

For the mailing list archives:

Dario's data contained so many NA's that some of the computed
dissimalirities "had to be" NA as well.
Had he used
    pam(all, 10)
    pam(all, 10, do.swap = FALSE)

he would have got the error message

   "No clustering performed, NAs in the computed dissimilarity matrix."

But because of  'cluster.only=TRUE' 
*and* because of a lapsus of the 'cluster' maintainer (me),
pam()  returned without the error message in this case.

The next release of R (or of 'cluster') will give the error
message also in the case of 'cluster.only=TRUE' .

Martin Maechler, ETH Zurich

    > I'm using R version 2.13.0 (2011-04-13) on Platform:
    > x86_64-unknown-linux-gnu (64-bit) and have cluster_1.13.3.

    > --------------------------------------
    > Dario Strbenac
    > Research Assistant
    > Cancer Epigenetics
    > Garvan Institute of Medical Research
    > Darlinghurst NSW 2010
    > Australia