Debiasing weighted multi-view k-means clustering based on causal regularization

被引:0
作者
Huang, Xiuqi [1 ]
Tao, Hong [1 ]
Ni, Haotian [1 ]
Hou, Chenping [1 ]
机构
[1] Natl Univ Def Technol, Coll Sci, Changsha 410073, Peoples R China
关键词
Multi-view; Clustering; Covariate balance; Causal regularization; SAMPLE SELECTION BIAS; INFERENCE; MODELS;
D O I
10.1016/j.patcog.2024.111195
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of unsupervised learning, many methods such as clustering rely on exploring the correlations among features. However, considering these correlations is not always advantageous for learning models. The biased selection of data may lead to redundant and unstable correlations among features, adversely affecting the performance of learning models. Multi-view data presents more complex feature correlations with potential redundancy and varying distributions across views, necessitating detailed analysis. This paper proposes a causal regularized debiased multi-view k-means clustering (DMKC) method to counteract redundant feature correlations stemming from sample selection bias. This method introduces a covariate weighted balance method from causal inference to mitigate redundant bias in multi-view clustering by adjusting sample weights. The approach combines sample and view weights within a k-means loss framework, effectively eliminating feature redundancy and enhancing clustering performance amidst sample selection bias. The optimization process of the relevant parameters is detailed in this paper, and comprehensive experiments demonstrate the effectiveness of the method.
引用
收藏
页数:12
相关论文
共 48 条
  • [1] Approximate residual balancing: debiased inference of average treatment effects in high dimensions
    Athey, Susan
    Imbens, Guido W.
    Wager, Stefan
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2018, 80 (04) : 597 - 623
  • [2] Cai X., 2013, P 23 INT JOINT C ART, P2598
  • [3] Counterfactuals and causability in explainable artificial intelligence: Theory, algorithms, and applications
    Chou, Yu-Liang
    Moreira, Catarina
    Bruza, Peter
    Ouyang, Chun
    Jorge, Joaquim
    [J]. INFORMATION FUSION, 2022, 81 : 59 - 83
  • [4] Cortes C, 2008, LECT NOTES ARTIF INT, V5254, P38, DOI 10.1007/978-3-540-87987-9_8
  • [5] Stable learning establishes some common ground between causal inference and machine learning
    Cui, Peng
    Athey, Susan
    [J]. NATURE MACHINE INTELLIGENCE, 2022, 4 (02) : 110 - 115
  • [6] Auto-attention mechanism for multi-view deep emb e dding clustering
    Diallo, Bassoma
    Hu, Jie
    Li, Tianrui
    Khan, Ghufran Ahmad
    Liang, Xinyan
    Wang, Hongjun
    [J]. PATTERN RECOGNITION, 2023, 143
  • [7] Ding C, 2005, LECT NOTES ARTIF INT, V3720, P530, DOI 10.1007/11564096_51
  • [8] Generalizing Graph Neural Networks on Out-of-Distribution Graphs
    Fan, Shaohua
    Wang, Xiao
    Shi, Chuan
    Cui, Peng
    Wang, Bai
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (01) : 322 - 337
  • [9] Landmark-based k-factorization multi-view subspace clustering
    Fang, Yuan
    Yang, Geping
    Chen, Xiang
    Gong, Zhiguo
    Yang, Yiyang
    Chen, Can
    Hao, Zhifeng
    [J]. INFORMATION SCIENCES, 2024, 667
  • [10] A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches
    Galar, Mikel
    Fernandez, Alberto
    Barrenechea, Edurne
    Bustince, Humberto
    Herrera, Francisco
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (04): : 463 - 484