k-means clustering for persistent homology

被引:0
作者
Cao, Yueqi [1 ]
Leung, Prudence [1 ]
Monod, Anthea [1 ]
机构
[1] Imperial Coll London, Dept Math, South Kensington Campus, London SW7 2AZ, England
关键词
Alexandrov geometry; Karush-Kuhn-Tucker optimization; k-means clustering; Persistence diagrams; Persistent homology; SIZE FUNCTIONS; STATISTICS; SHAPE;
D O I
10.1007/s11634-023-00578-y
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Persistent homology is a methodology central to topological data analysis that extracts and summarizes the topological features within a dataset as a persistence diagram. It has recently gained much popularity from its myriad successful applications to many domains, however, its algebraic construction induces a metric space of persistence diagrams with a highly complex geometry. In this paper, we prove convergence of the k-means clustering algorithm on persistence diagram space and establish theoretical properties of the solution to the optimization problem in the Karush-Kuhn-Tucker framework. Additionally, we perform numerical experiments on both simulated and real data of various representations of persistent homology, including embeddings of persistence diagrams as well as diagrams themselves and their generalizations as persistence measures. We find that k-means clustering performance directly on persistence diagrams and measures outperform their vectorized representations.
引用
收藏
页码:95 / 119
页数:25
相关论文
共 52 条
[1]  
Adams H, 2017, J MACH LEARN RES, V18
[2]  
Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
[3]   Homological persistence in time series: an application to music classification [J].
Bergomi, Mattia G. ;
Barate, Adriano .
JOURNAL OF MATHEMATICS AND MUSIC, 2020, 14 (02) :204-221
[4]   Persistent Homology for Path Planning in Uncertain Environments [J].
Bhattacharya, Subhrajit ;
Ghrist, Robert ;
Kumar, Vijay .
IEEE TRANSACTIONS ON ROBOTICS, 2015, 31 (03) :578-590
[5]  
Billard L, 2000, ST CLASS DAT ANAL, P369
[6]  
Blanchard M, 2022, ARXIV
[7]  
Boyd Stephen, 2004, Convex Optimization
[8]  
Bubenik P, 2015, J MACH LEARN RES, V16, P77
[9]   Centrosymmetric stochastic matrices [J].
Cao, Lei ;
McLaren, Darian ;
Plosker, Sarah .
LINEAR & MULTILINEAR ALGEBRA, 2022, 70 (03) :449-464
[10]  
Chazal F., 2016, The Structure and Stability of Persistence Modules, DOI DOI 10.1007/978-3-319-42545-0