Ensemble learning using three-way density-sensitive spectral clustering

被引:25
作者
Fan, Jiachen [1 ]
Wang, Pingxin [2 ]
Jiang, Chunmao [3 ]
Yang, Xibei [1 ]
Song, Jingjing [1 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Comp, Zhenjiang 212003, Jiangsu, Peoples R China
[2] Jiangsu Univ Sci & Technol, Sch Sci, Zhenjiang 212003, Jiangsu, Peoples R China
[3] Harbin Normal Univ, Coll Comp Sci & Informat Engn, Harbin 150025, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Spectral clustering; Density-sensitive; Three-way clustering; Clustering ensemble; ROUGH SET; DECISION; CLASSIFICATION; REGIONS;
D O I
10.1016/j.ijar.2022.07.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As one popular clustering algorithm in the last few years, spectral clustering is advantageous over most existing clustering algorithms. Although spectral clustering can perform well in many instances, the algorithm still has some problems. The clusters obtained by spectral clustering have crisp boundaries, which cannot reflect the fact that one cluster may not have a well-defined boundary in the real situations. Furthermore, the frequently-used distance measures in spectral clustering cannot satisfy both global and local consistency, especially for the data with multi-scale. In order to address the above limitations, we firstly present a three-way density-sensitive spectral clustering algorithm, which uses the core region and the fringe region to represent a cluster. In the proposed algorithm, we use density-sensitive distance to produce a similarity matrix, which can well capture the real data structures. An overlap clustering is introduced to obtain the upper bound (unions of the core regions and the fringe regions) of each cluster and perturbation analysis is applied to separate the core regions from the upper bounds. The fringe region of the specific cluster is the differences between the upper bound and the core region. Because a single clustering algorithm cannot always achieve a good clustering result, we develop an improved ensemble three-way spectral clustering algorithm based on ensemble strategy. The proposed ensemble algorithm randomly extracts feature subset of sample and uses the three-way density-sensitive clustering algorithm to obtain the diverse base clustering results. Based on the base clustering results, voting method is used to generate a three-way clustering result. The experimental results show that the three-way density-sensitive clustering algorithm can well explain the data structure and maintain a good clustering performance at the same time, and the ensemble three-way density-sensitive spectral clustering can improve the robustness and stability of clustering results. (C) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:70 / 84
页数:15
相关论文
共 64 条
[1]   A three-way clustering approach for handling missing data using GTRS [J].
Afridi, Mohammad Khan ;
Azam, Nouman ;
Yao, JingTao ;
Alanazi, Eisa .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2018, 98 :11-24
[2]   A spatial filtering inspired three-way clustering approach with application to outlier detection [J].
Ali, Bahar ;
Azam, Nouman ;
Shah, Anwar ;
Yao, JingTao .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2021, 130 :1-21
[3]  
Bezdek J. C., 1981, Pattern recognition with fuzzy objective function algorithms
[4]   Neighborhood rough set-based three-way clustering considering attribute correlations: An approach to classification of potential gout groups [J].
Chu, Xiaoli ;
Sun, Bingzhen ;
Li, Xue ;
Han, Keyu ;
Wu, JiaQi ;
Zhang, Yan ;
Huang, Qingchun .
INFORMATION SCIENCES, 2020, 535 :28-41
[5]  
Fanti C, 2004, ADV NEUR IN, V16, P1603
[6]   Combining multiple clusterings using evidence accumulation [J].
Fred, ALN ;
Jain, AK .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (06) :835-850
[7]   Improving awareness in early stages of security analysis: A zone partition method based on GrC [J].
Fujita, Hamido ;
Gaeta, Angelo ;
Loia, Vincenzo ;
Orciuoli, Francesco .
APPLIED INTELLIGENCE, 2019, 49 (03) :1063-1077
[8]   Incremental updating probabilistic neighborhood three-way regions with time-evolving attributes [J].
Hu, Chengxiang ;
Zhang, Li .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2020, 120 :1-23
[9]   Ensemble clustering using factor graph [J].
Huang, Dong ;
Lai, Jianhuang ;
Wang, Chang-Dong .
PATTERN RECOGNITION, 2016, 50 :131-142
[10]   An automatic three-way clustering method based on sample similarity [J].
Jia, Xiuyi ;
Rao, Ya ;
Li, Weiwei ;
Yang, Sichun ;
Yu, Hong .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (05) :1545-1556