Skip to content
Prev 332570 / 398506 Next

Hierarchical Cluster Analysis with large dataset

On Sun, 3 Nov 2013 10:42:06 +0100 Petar Milin
<petar.milin at uni-tuebingen.de> wrote:

            
You have two different issues here: size of dataset (number of
observations which prevents storage in memory of the distance matrix)
and number of variables (which does not, but probably hinders reading
in the dataset.

You need to provide more information here: why do you need/want to do
hierarchical clustering, if so, do you only need to use R. What
hardware you have at your disposal, etc.

Depending on your answers to the above, this may well be a research
problem in its own right.

HTH!

Best wishes,
Ranjan