Hierarchical Cluster Analysis with large dataset
Hi, I think your dataset is too large to be interpretable, but in general you should check out the cluster package, specifically clara(), which is intended for use with large data. Sarah On Sun, Nov 3, 2013 at 4:42 AM, Petar Milin
<petar.milin at uni-tuebingen.de> wrote:
Hello! Can anyone give me advice on running Hierarchical Cluster Analysis on large datasets? For example, 80000x10000. Calculating distances on such a dataframe seems impossible even on very powerful computer. Also, any other advice that would lead to reduction of dimensionality, i.e., cluster/group variables would be more than welcomed. Many thanks, PM
Sarah Goslee http://www.functionaldiversity.org