Semi-supervised discriminative clustering with graph regularization

被引:17
|
作者
Smieja, Marek [1 ]
Myronov, Oleksandr [2 ]
Tabor, Jacek [1 ]
机构
[1] Jagiellonian Univ, Fac Math & Comp Sci, Lojasiewicza 6, PL-30348 Krakow, Poland
[2] Ardigen SA, Bobrzynskiego 14, PL-30348 Krakow, Poland
关键词
Semi-supervised clustering; Discriminative model; Pairwise constraints; Graph clustering;
D O I
10.1016/j.knosys.2018.03.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pairwise constraints are a typical form of class information used in semi-supervised clustering. Although various methods were proposed to combine unlabeled data with pairwise constraints, most of them rely on adapting existing clustering frameworks, such as GMM or k-means, to semi-supervised setting. In consequence, pairwise relations have to be transferred into particular clustering model, which is often contradictory with expert knowledge. In this paper we propose a novel semi-supervised method, d-graph, which does not assume any pre-defined structure of clusters. We follow a discriminative approach and use logistic function to directly model posterior probabilities p(k/x) that point x belongs to kth cluster. Making use of these posterior probabilities we maximize the expected probability that pairwise constraints are preserved. To include unlabeled data in our clustering objective function, we introduce additional pairwise constraints so that nearby points are more likely to appear in the same cluster. The proposed model can be easily optimized with the use of gradient techniques and kernelized, which allows to discover arbitrary shapes and structures in data. The experimental results performed on various types of data demonstrate that d-graph obtains better clustering results than comparative state-of-the-art methods. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:24 / 36
页数:13
相关论文
共 50 条
  • [1] A survey on semi-supervised graph clustering
    Daneshfar, Fatemeh
    Soleymanbaigi, Sayvan
    Yamini, Pedram
    Amini, Mohammad Sadra
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133 (133)
  • [2] Semi-supervised graph clustering: a kernel approach
    Brian Kulis
    Sugato Basu
    Inderjit Dhillon
    Raymond Mooney
    Machine Learning, 2009, 74 : 1 - 22
  • [3] Semi-supervised graph clustering: a kernel approach
    Kulis, Brian
    Basu, Sugato
    Dhillon, Inderjit
    Mooney, Raymond
    MACHINE LEARNING, 2009, 74 (01) : 1 - 22
  • [4] Semi-supervised clustering with discriminative random fields
    Chang, Chin-Chun
    Chen, Hsin-Yi
    PATTERN RECOGNITION, 2012, 45 (12) : 4402 - 4413
  • [5] Consistency regularization for deep semi-supervised clustering with pairwise constraints
    Dan Huang
    Jie Hu
    Tianrui Li
    Shengdong Du
    Hongmei Chen
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 3359 - 3372
  • [6] Adaptive and structured graph learning for semi-supervised clustering
    Chen, Long
    Zhong, Zhi
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (04)
  • [7] Consistency regularization for deep semi-supervised clustering with pairwise constraints
    Huang, Dan
    Hu, Jie
    Li, Tianrui
    Du, Shengdong
    Chen, Hongmei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (11) : 3359 - 3372
  • [8] An efficient semi-supervised graph based clustering
    Viet-Vu Vu
    INTELLIGENT DATA ANALYSIS, 2018, 22 (02) : 297 - 307
  • [9] Semi-supervised fuzzy clustering with metric learning and entropy regularization
    Yin, Xuesong
    Shu, Ting
    Huang, Qi
    KNOWLEDGE-BASED SYSTEMS, 2012, 35 : 304 - 311
  • [10] Effective semi-supervised graph clustering with pairwise constraints
    Chen, Jingwei
    Xie, Shiyu
    Yang, Hui
    Nie, Feiping
    INFORMATION SCIENCES, 2024, 681