Document clustering for R
Hi, We discovered that the package "amap" contain a distance calculation function call Dist which can calculate the distance according to a method call "pearson" which is in fact the "not centered Pearson" which seems to be the cosine distance. Could you tell me what do you think on that? Best regards, David
On Sep 12, 2005, at 21:47, Raymond K Pon wrote:
I'm working on a project related to document clustering. I know that R has clustering algorithms such as clara, but only supports two distance metrics: euclidian and manhattan, which are not very useful for clustering documents. I was wondering how easy it would be to extend the clustering package in R to support other distance metrics, such as cosine distance, or if there was an API for custom distance metrics. Best regards, Raymond Pon pon3 at llnl.gov x43062
______________________________________________ R-help at stat.math.ethz.ch mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html