Renyi entropy driven hierarchical graph clustering

被引:0
作者
Oggier F. [1 ]
Datta A. [2 ]
机构
[1] Division of Mathematical Sciences, Nanyang Technological University, Singapore
[2] School of Computer Engineering, Nanyang Technological University, Singapore
关键词
Graph clustering; Renyi entropy;
D O I
10.7717/PEERJ-CS.366
中图分类号
学科分类号
摘要
This article explores a graph clustering method that is derived from an information theoretic method that clusters points in ℝn relying on Renyi entropy, which involves computing the usual Euclidean distance between these points. Two view points are adopted: (1) the graph to be clustered is first embedded into ℝd for some dimension d so as to minimize the distortion of the embedding, then the resulting points are clustered, and (2) the graph is clustered directly, using as distance the shortest path distance for undirected graphs, and a variation of the Jaccard distance for directed graphs. In both cases, a hierarchical approach is adopted, where both the initial clustering and the agglomeration steps are computed using Renyi entropy derived evaluation functions. Numerical examples are provided to support the study, showing the consistency of both approaches (evaluated in terms of F-scores). © 2021 Oggier and Datta. All Rights Reserved.
引用
收藏
页码:1 / 19
页数:18
相关论文
共 28 条
  • [1] Andersen MS, Dahl J, Vandenberghe L., Cvxopt: a python package for convex optimization, (2013)
  • [2] Bourgain J., On lipschitz embeddings of finite metric spaces in hilbert space, Israel Journal of Mathematics, 52, 1-2, pp. 46-52, (1985)
  • [3] Countinho J., Montreal street gangs, (2016)
  • [4] Diamond S, Boyd S., Cvxpy: a python-embedded modeling language for convex optimization, Journal of Machine Learning Research, 17, 83, pp. 2909-2913, (2016)
  • [5] Faivishevsky L, Goldberger J., A nonparametricinformation theoretic clustering algorithmIn, International Conference on Machine Learning (ICML), (2010)
  • [6] Fruchterman T, Reingold E., Graph drawing by force-directed placement, Journal of, (1991)
  • [7] Software: Practice and Experience, 21, 11, pp. 1129-1164
  • [8] Gokcay E, Principe J., Information theoretic clustering, IEEE Transactions on Pattern Analysis and Machine Intelligence, (2002)
  • [9] Hartigan JA., Clustering algorithms, (1975)
  • [10] Jaccard P., The distribution of the flora in the alpine zone, New Phytologist, 11, 2, pp. 37-50, (1912)