A conditionally positive definite kernel function for clustering of incomplete data

被引：0

作者：

Goel, Sonia ^{[1
]}

Tushir, Meena ^{[1
]}

机构：

[1] Guru Gobind Singh Indraprastha Univ, Maharaja Surajmal Inst Technol, Dept Elect & Elect Engn, New Delhi, India

来源：

JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES | 2024年 / 45卷 / 02期

关键词：

Clustering; Incomplete data; Imputation; Non-imputation techniques; Kernel function; Positive definite & conditionally positive definite kernel function; IMPUTATION;

D O I：

10.47974/JIOS-1557

中图分类号：

G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];

学科分类号：

1205 ; 120501 ;

摘要：

Clustering of incomplete data sets that contains missing features is one of the most widely studied problems in the literature, and several imputation and non-imputation techniques are used to solve this problem. A weighted sum of the Euclidean distance from the datum to the corresponding clusters is used in Fuzzy c-means clustering. It has been observed that the kernel-based clustering techniques outperform the conventional algorithms in terms of accuracy. This is due to their ability to handle non-linear data and map it to higher dimensional space while preserving its internal structure. Kernel functions are really important when it comes to the performance of kernel-based clustering methods. Choosing the right kernel function isn't simple. Among the various clustering algorithms that have been examined in the literature, the, Gaussian kernel function has been found to be more useful.. This paper suggests a conditionally positive definite kernel function that can be used in the unsupervised clustering of incomplete data. Numerical analysis shows that the conditionally positive definite kernel function also performs well on datasets with incomplete features.

引用

页码：403 / 412

页数：10

共 50 条

[31] Affinity Propagation Clustering with Incomplete Data
Lu, Cheng
Song, Shiji
Wu, Cheng
COMPUTATIONAL INTELLIGENCE, NETWORKED SYSTEMS AND THEIR APPLICATIONS, 2014, 462 : 239 - 248
[32] Bioinspired Hybrid and Incomplete Data Clustering
Tusell-Rey, Claudia C.
Villuendas-Rey, Yenny
Camacho-Nieto, Oscar
Salinas-Garcia, Viridiana
INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2024, 15 (04): : 85 - 100
[33] Riemannian competitive learning for symmetric positive definite matrices clustering
Zheng, Ligang
Qiu, Guoping
Huang, Jiwu
NEUROCOMPUTING, 2018, 295 : 153 - 164
[34] Kernel regression estimation for incomplete data with applications
Mojirsheibani, Majid
Reese, Timothy
STATISTICAL PAPERS, 2017, 58 (01) : 185 - 209
[35] Kernel regression estimation for incomplete data with applications
Majid Mojirsheibani
Timothy Reese
Statistical Papers, 2017, 58 : 185 - 209
[36] CLINCH: Clustering incomplete high-dimensional data for data mining application
Cheng, ZP
Zhou, D
Wang, C
Guo, JK
Wang, W
Ding, BK
Shi, B
WEB TECHNOLOGIES RESEARCH AND DEVELOPMENT - APWEB 2005, 2005, 3399 : 88 - 99
[37] A Clustering Algorithm via Kernel Function and Locality Preserving Projections
Zhan, Mengmeng
Lu, Guangquan
Wen, Guoqiu
Zhang, Leyuan
Wu, Lin
2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2620 - 2625
[38] A Three-Way Decisions Clustering Algorithm for Incomplete Data
Yu, Hong
Su, Ting
Zeng, Xianhua
ROUGH SETS AND KNOWLEDGE TECHNOLOGY, RSKT 2014, 2014, 8818 : 765 - 776
[39] Fuzzy Clustering Algorithm of Kernel for Gene Expression Data Analysis
Liu, Wenyuan
Zhang, Bin
2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 553 - 556
[40] Model-Based Clustering for Conditionally Correlated Categorical Data
Matthieu Marbac
Christophe Biernacki
Vincent Vandewalle
Journal of Classification, 2015, 32 : 145 - 175

← 1 2 3 4 5 →