Interpretable Clustering via Soft Clustering Trees

被引:0
|
作者
Cohen, Eldan [1 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
来源
INTEGRATION OF CONSTRAINT PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND OPERATIONS RESEARCH, CPAIOR 2023 | 2023年 / 13884卷
关键词
DECISION TREE;
D O I
10.1007/978-3-031-33271-5_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is a popular unsupervised learning task that consists of finding a partition of the data points that groups similar points together. Despite its popularity, most state-of-the-art algorithms do not provide any explanation of the obtained partition, making it hard to interpret. In recent years, several works have considered using decision trees to construct clusters that are inherently interpretable. However, these approaches do not scale to large datasets, do not account for uncertainty in results, and do not support advanced clustering objectives such as spectral clustering. In this work, we present soft clustering trees, an interpretable clustering approach that is based on soft decision trees that provide probabilistic cluster membership. We model soft clustering trees as continuous optimization problem that is amenable to efficient optimization techniques. Our approach is designed to output highly sparse decision trees to increase interpretability and to support tree-based spectral clustering. Extensive experiments show that our approach can produce clustering trees of significantly higher quality compared to the state-of-the-art and scale to large datasets.
引用
收藏
页码:281 / 298
页数:18
相关论文
共 50 条
  • [31] Ensemble of randomized soft decision trees for robust classification
    Kumar, G. Kishor
    Viswanath, P.
    Rao, A. Ananda
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2016, 41 (03): : 273 - 282
  • [32] Fuzzified Cuckoo based Clustering Technique for Network Anomaly Detection
    Garg, Sahil
    Batra, Shalini
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 71 : 798 - 817
  • [33] Radar Emitter Recognition Based on Parameter Set Clustering and Classification
    Xu, Tao
    Yuan, Shuo
    Liu, Zhangmeng
    Guo, Fucheng
    REMOTE SENSING, 2022, 14 (18)
  • [34] Recursive decision tree induction based on homogeneousness for data clustering
    Varghese, Bindiya M.
    Unnikrishnan, A.
    PROCEEDINGS OF THE 2008 INTERNATIONAL CONFERENCE ON CYBERWORLDS, 2008, : 754 - +
  • [35] Bad Data Detection Algorithm for PMU Based on Spectral Clustering
    Zhiwei Yang
    Hao Liu
    Tianshu Bi
    Qixun Yang
    Journal of Modern Power Systems and Clean Energy, 2020, 8 (03) : 473 - 483
  • [36] FCM BP based parameter clustering method in speech recognition
    Xu, XH
    Zhu, J
    Guo, Q
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 3717 - 3720
  • [37] Bad Data Detection Algorithm for PMU Based on Spectral Clustering
    Yang Z.
    Liu H.
    Bi T.
    Yang Q.
    Journal of Modern Power Systems and Clean Energy, 2020, 8 (03): : 473 - 483
  • [38] Bad Data Detection Algorithm for PMU Based on Spectral Clustering
    Yang, Zhiwei
    Liu, Hao
    Bi, Tianshu
    Yang, Qixun
    JOURNAL OF MODERN POWER SYSTEMS AND CLEAN ENERGY, 2020, 8 (03) : 473 - 483
  • [39] Re-evaluation of subtypes of positional OSAS by clustering algorithms
    Karaibrahimoglu, Adnan
    Ozturk, Onder
    PROGRESS IN NUTRITION, 2020, 22
  • [40] Hybrid soft computing approach based on clustering, rule mining, and decision tree analysis for customer segmentation problem: Real case of customer-centric industries
    Khalili-Damghani, Kaveh
    Abdi, Farshid
    Abolmakarem, Shaghayegh
    APPLIED SOFT COMPUTING, 2018, 73 : 816 - 828