Hierarchical Cluster Analysis with large dataset
(Offlist, since this is just a personal comment). I cannot help you -- but it sounds like the sort of thing that you should look for on the BioconductoR list. But I wonder how you could possibly interpret the results even if you could get them. I would think they would be more noise than signal, and making sense of such a mess would be hopeless. Maybe you need to rethink your approach. No need to respond to me, of course. Cheers, Bert On Sun, Nov 3, 2013 at 1:42 AM, Petar Milin
<petar.milin at uni-tuebingen.de> wrote:
Hello!
Can anyone give me advice on running Hierarchical Cluster Analysis on large
datasets? For example, 80000x10000. Calculating distances on such a
dataframe seems impossible even on very powerful computer.
Also, any other advice that would lead to reduction of dimensionality,
i.e., cluster/group variables would be more than welcomed.
Many thanks,
PM
[[alternative HTML version deleted]]
______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Bert Gunter Genentech Nonclinical Biostatistics (650) 467-7374