Adaptive kernel fuzzy clustering for missing data

被引:3
|
作者
Rodrigues, Anny K. G. [1 ]
Ospina, Raydonal [1 ]
Ferreira, Marcelo R. P. [2 ]
机构
[1] Univ Fed Pernambuco, CCEN, Dept Estat, CASTLab, Recife, PE, Brazil
[2] Univ Fed Paraiba, Ctr Ciencias Exatas & Nat, Dept Estat, DataLab, Joao Pessoa, Paraiba, Brazil
来源
PLOS ONE | 2021年 / 16卷 / 11期
关键词
MULTIPLE IMPUTATION; ALGORITHM; FRAMEWORK; VALUES;
D O I
10.1371/journal.pone.0259266
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Many machine learning procedures, including clustering analysis are often affected by missing values. This work aims to propose and evaluate a Kernel Fuzzy C-means clustering algorithm considering the kernelization of the metric with local adaptive distances (VKFCM-K-LP) under three types of strategies to deal with missing data. The first strategy, called Whole Data Strategy (WDS), performs clustering only on the complete part of the dataset, i.e. it discards all instances with missing data. The second approach uses the Partial Distance Strategy (PDS), in which partial distances are computed among all available resources and then re-scaled by the reciprocal of the proportion of observed values. The third technique, called Optimal Completion Strategy (OCS), computes missing values iteratively as auxiliary variables in the optimization of a suitable objective function. The clustering results were evaluated according to different metrics. The best performance of the clustering algorithm was achieved under the PDS and OCS strategies. Under the OCS approach, new datasets were derive and the missing values were estimated dynamically in the optimization process. The results of clustering under the OCS strategy also presented a superior performance when compared to the resulting clusters obtained by applying the VKFCM-K-LP algorithm on a version where missing values are previously imputed by the mean or the median of the observed values.
引用
收藏
页数:33
相关论文
共 50 条
  • [41] Adaptive Missing Texture Reconstruction Method Based on Kernel Canonical Correlation Analysis with a New Clustering Scheme
    Ogawa, Takahiro
    Haseyama, Miki
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2009, E92A (08) : 1950 - 1960
  • [42] CLUSTERING OF DATA WITH MISSING ENTRIES
    Poddar, Sunrita
    Jacob, Mathews
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : CP46 - CP46
  • [43] A kernel extension to handle missing data
    Nebot-Troyano, Guillermo
    Belanche-Munoz, Lluis A.
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 165 - 178
  • [44] Adaptive approach to fuzzy clustering
    Yue, Shi-Hong
    Li, Ping
    Song, Zhi-Huan
    Gu, Ying-Kun
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2004, 38 (10): : 1280 - 1284
  • [45] Differentiated treatment of missing values in fuzzy clustering
    Timm, H
    Döring, C
    Kruse, R
    FUZZY SETS AND SYSTEMS - IFSA 2003, PROCEEDINGS, 2003, 2715 : 354 - 361
  • [46] Nonparametric Fisher kernel using fuzzy clustering
    Inokuchi, Ryo
    Miyamoto, Sadaaki
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2006, 4252 : 78 - 85
  • [47] A kernel-based fuzzy clustering algorithm
    Wang, Jiun-Hau
    Lee, Wan-Jui
    Lee, Shie-Jue
    ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 1, PROCEEDINGS, 2006, : 550 - +
  • [48] Transfer Learning Based Kernel Fuzzy Clustering
    Dang, Bozhan
    Zhou, Jin
    Liu, Xiangdao
    Wang, Rongrong
    Wang, Lin
    Han, Shiyuan
    Chen, Yuehui
    2019 INTERNATIONAL CONFERENCE ON FUZZY THEORY AND ITS APPLICATIONS (IFUZZY), 2019, : 21 - 25
  • [49] Performance of kernel-based fuzzy clustering
    Graves, D.
    Pedrycz, W.
    ELECTRONICS LETTERS, 2007, 43 (25) : 1445 - 1446
  • [50] Fuzzy Clustering-Based Adaptive Regression for Drifting Data Streams
    Song, Yiliao
    Lu, Jie
    Lu, Haiyan
    Zhang, Guangquan
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (03) : 544 - 557