Skip to content

Branch Ratios & other Indexes for Trees/Dendrograms

1 message · Leonard Mada

#
Dear R Users,

This is a follow-up to one of my older messages. Although I did not receive useful suggestions back then, I did some research on my own and found an R package and some references in the literature. I hope that this information is useful to others as well.

### Tree Balance

The problem can be solved using indexes that measure the tree balance. Package mdendro implements 2 such indexes (chaining coefficient and a tree balance coefficient, which is actually the entropy of the tree structure).
https://cran.r-project.org/web/packages/mdendro/index.html

The paper referenced by mdendro is in the meantime published as well (it was not accessible a few days ago):
Fern?ndez, A., & G?mez, S. (2025). mdendro: An R Package for Extended Agglomerative Hierarchical Clustering. Journal of Statistical Software, 114(2), 1?26. https://doi.org/10.18637/jss.v114.i02


The other reference is much older (and I do not have access to it):
Williams W, Lambert J, Lance G (1966). ?Multivariate Methods in Plant Ecology: V.
Similarity Analyses and Information-Analysis.? Journal of Ecology, 54(2), 427?445. doi:
10.2307/2257960.
[it may be possible to read online, but not download the article; I have not fully checked]

I have included some functions in my own R code on GitHub, like index.chaining and index.entropy:
https://github.com/discoleo/PeptideClassifier/blob/main/R/Helper.Tree.Analysis.R

I hope this info is useful to others as well.

Sincerely,

Leonard

Initial Message:
https://stat.ethz.ch/pipermail/r-help/2025-August/481164.html