Gaussian Mixture Model Clustering with Incomplete Data

被引:34
|
作者
Zhang, Yi [1 ]
Li, Miaomiao [1 ,2 ]
Wang, Siwei [1 ]
Dai, Sisi [1 ]
Luo, Lei [1 ]
Zhu, En [1 ]
Xu, Huiying [3 ,4 ]
Zhu, Xinzhong [3 ]
Yao, Chaoyun [5 ]
Zhou, Haoran [6 ]
机构
[1] NUDT, Sch Comp, Changsha, Peoples R China
[2] Changsha Univ, Changsha, Hunan, Peoples R China
[3] Zhejiang Normal Univ, Coll Math & Comp Sci, Hangzhou, Zhejiang, Peoples R China
[4] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[5] NUDT, Lab Complex Electromagnet Environm Effects Elect, Changsha, Peoples R China
[6] Chongqing Univ Technol, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
GMM; clustering; EM; incomplete data;
D O I
10.1145/3408318
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gaussian mixturemodel (GMM) clustering has been extensively studied due to its effectiveness and efficiency. Though demonstrating promising performance in various applications, it cannot effectively address the absent features among data, which is not uncommon in practical applications. In this article, different from existing approaches that first impute the absence and then perform GMM clustering tasks on the imputed data, we propose to integrate the imputation and GMM clustering into a unified learning procedure. Specifically, the missing data is filled by the result of GMM clustering, and the imputed data is then taken for GMM clustering. These two steps alternatively negotiate with each other to achieve optimum. By this way, the imputed data can best serve for GMM clustering. A two-step alternative algorithm with proved convergence is carefully designed to solve the resultant optimization problem. Extensive experiments have been conducted on eight UCI benchmark datasets, and the results have validated the effectiveness of the proposed algorithm.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Assessing clustering strategies for Gaussian mixture filtering a subsurface contaminant model
    Liu, B.
    Gharamti, M. E.
    Hoteit, I.
    JOURNAL OF HYDROLOGY, 2016, 535 : 1 - 21
  • [22] Color Detection and Segmentation of the Scene Based on Gaussian Mixture Model Clustering
    Ye, Huiying
    Zheng, Lin
    Liu, Pengfei
    PROCEEDINGS OF 2017 IEEE 7TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC), 2017, : 503 - 506
  • [23] A Spatial Gaussian Mixture Model for Optical Remote Sensing Image Clustering
    Zhao, Bei
    Zhong, Yanfei
    Ma, Ailong
    Zhang, Liangpei
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2016, 9 (12) : 5748 - 5759
  • [24] Gaussian Mixture Reduction via Clustering
    Schieferdecker, Dennis
    Huber, Marco F.
    FUSION: 2009 12TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOLS 1-4, 2009, : 1536 - +
  • [25] A particular Gaussian mixture model for clustering and its application to image retrieval
    Sahbi, Hichem
    SOFT COMPUTING, 2008, 12 (07) : 667 - 676
  • [26] On Fuzzy Non-Metric Model for Data with Tolerance and its Application to Incomplete Data Clustering
    Endo, Yasunori
    Suzuki, Tomoyuki
    Kinoshita, Naohiko
    Hamasuna, Yukihiro
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2016, 20 (04) : 571 - 579
  • [27] Gaussian kernels for incomplete data
    Mesquita, Diego P. P.
    Gomes, Joao P. P.
    Corona, Francesco
    Souza Junior, Amauri H.
    Nobre, Juvencio S.
    APPLIED SOFT COMPUTING, 2019, 77 : 356 - 365
  • [28] A partial order framework for incomplete data clustering
    Hamdi Yahyaoui
    Hosam AboElfotoh
    Yanjun Shu
    Applied Intelligence, 2023, 53 : 7439 - 7454
  • [29] Reject Inference of Incomplete Data Using a Normal Mixture Model
    Song, Juwon
    KOREAN JOURNAL OF APPLIED STATISTICS, 2011, 24 (02) : 425 - 433
  • [30] On the parameterized complexity of clustering problems for incomplete data
    Eiben, Eduard
    Ganian, Robert
    Kanj, Iyad
    Ordyniak, Sebastian
    Szeider, Stefan
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2023, 134 : 1 - 19