Efficient Constrained K-center Clustering with Background Knowledge

被引:0
作者
Guo, Longkun [1 ,2 ]
Jia, Chaoqi [2 ]
Liao, Kewen [3 ]
Lu, Zhigang [4 ]
Xue, Minhui [5 ]
机构
[1] Fuzhou Univ, Sch Math & Stat, Fuzhou 350116, Peoples R China
[2] Shandong Acad Sci, Qilu Univ Technol, Shandong Comp Sci Ctr, Key Lab Comp Power Network & Informat Secur,Mini, Jinan 250316, Peoples R China
[3] Australian Catholic Univ, Peter Faber Business Sch, HilstLab, Sydney 2060, Australia
[4] James Cook Univ, Coll Sci & Engn, Townsville 2060, Australia
[5] CSIROs Data61, Sydney 2015, Australia
来源
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 18 | 2024年
基金
美国国家科学基金会;
关键词
INSTANCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Center-based clustering has attracted significant research interest from both theory and practice. In many practical applications, input data often contain background knowledge that can be used to improve clustering results. In this work, we build on widely adopted k-center clustering and model its input background knowledge as must-link (ML) and cannot-link (CL) constraint sets. However, most clustering problems including k-center are inherently NP-hard, while the more complex constrained variants are known to suffer severer approximation and computation barriers that significantly limit their applicability. By employing a suite of techniques including reverse dominating sets, linear programming (LP) integral polyhedron, and LP duality, we arrive at the first efficient approximation algorithm for constrained k-center with the best possible ratio of 2. We also construct competitive baseline algorithms and empirically evaluate our approximation algorithm against them on a variety of real datasets. The results validate our theoretical findings and demonstrate the great advantages of our algorithm in terms of clustering cost, clustering quality, and running time.
引用
收藏
页码:20709 / 20717
页数:9
相关论文
empty
未找到相关数据