A conditionally positive definite kernel function for clustering of incomplete data

被引:0
|
作者
Goel, Sonia [1 ]
Tushir, Meena [1 ]
机构
[1] Guru Gobind Singh Indraprastha Univ, Maharaja Surajmal Inst Technol, Dept Elect & Elect Engn, New Delhi, India
来源
JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES | 2024年 / 45卷 / 02期
关键词
Clustering; Incomplete data; Imputation; Non-imputation techniques; Kernel function; Positive definite & conditionally positive definite kernel function; IMPUTATION;
D O I
10.47974/JIOS-1557
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Clustering of incomplete data sets that contains missing features is one of the most widely studied problems in the literature, and several imputation and non-imputation techniques are used to solve this problem. A weighted sum of the Euclidean distance from the datum to the corresponding clusters is used in Fuzzy c-means clustering. It has been observed that the kernel-based clustering techniques outperform the conventional algorithms in terms of accuracy. This is due to their ability to handle non-linear data and map it to higher dimensional space while preserving its internal structure. Kernel functions are really important when it comes to the performance of kernel-based clustering methods. Choosing the right kernel function isn't simple. Among the various clustering algorithms that have been examined in the literature, the, Gaussian kernel function has been found to be more useful.. This paper suggests a conditionally positive definite kernel function that can be used in the unsupervised clustering of incomplete data. Numerical analysis shows that the conditionally positive definite kernel function also performs well on datasets with incomplete features.
引用
收藏
页码:403 / 412
页数:10
相关论文
共 50 条
  • [31] Affinity Propagation Clustering with Incomplete Data
    Lu, Cheng
    Song, Shiji
    Wu, Cheng
    COMPUTATIONAL INTELLIGENCE, NETWORKED SYSTEMS AND THEIR APPLICATIONS, 2014, 462 : 239 - 248
  • [32] Bioinspired Hybrid and Incomplete Data Clustering
    Tusell-Rey, Claudia C.
    Villuendas-Rey, Yenny
    Camacho-Nieto, Oscar
    Salinas-Garcia, Viridiana
    INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2024, 15 (04): : 85 - 100
  • [33] Riemannian competitive learning for symmetric positive definite matrices clustering
    Zheng, Ligang
    Qiu, Guoping
    Huang, Jiwu
    NEUROCOMPUTING, 2018, 295 : 153 - 164
  • [34] Kernel regression estimation for incomplete data with applications
    Mojirsheibani, Majid
    Reese, Timothy
    STATISTICAL PAPERS, 2017, 58 (01) : 185 - 209
  • [35] Kernel regression estimation for incomplete data with applications
    Majid Mojirsheibani
    Timothy Reese
    Statistical Papers, 2017, 58 : 185 - 209
  • [36] CLINCH: Clustering incomplete high-dimensional data for data mining application
    Cheng, ZP
    Zhou, D
    Wang, C
    Guo, JK
    Wang, W
    Ding, BK
    Shi, B
    WEB TECHNOLOGIES RESEARCH AND DEVELOPMENT - APWEB 2005, 2005, 3399 : 88 - 99
  • [37] A Clustering Algorithm via Kernel Function and Locality Preserving Projections
    Zhan, Mengmeng
    Lu, Guangquan
    Wen, Guoqiu
    Zhang, Leyuan
    Wu, Lin
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2620 - 2625
  • [38] A Three-Way Decisions Clustering Algorithm for Incomplete Data
    Yu, Hong
    Su, Ting
    Zeng, Xianhua
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, RSKT 2014, 2014, 8818 : 765 - 776
  • [39] Fuzzy Clustering Algorithm of Kernel for Gene Expression Data Analysis
    Liu, Wenyuan
    Zhang, Bin
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 553 - 556
  • [40] Model-Based Clustering for Conditionally Correlated Categorical Data
    Matthieu Marbac
    Christophe Biernacki
    Vincent Vandewalle
    Journal of Classification, 2015, 32 : 145 - 175