A dimension reduction algorithm preserving both global and local clustering structure

被引:23
作者
Cai, Weiling [1 ]
机构
[1] Nanjing Normal Univ, Dept Comp Sci & Technol, Nanjing 210097, Jiangsu, Peoples R China
基金
美国国家科学基金会;
关键词
Pattern recognition; Dimension reduction; Clustering learning; Global structure preserving; Local structure preserving; LINEAR DISCRIMINANT-ANALYSIS; SUPPORT VECTOR MACHINES; RECOGNITION; COMBINATION;
D O I
10.1016/j.knosys.2016.11.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
By combining linear discriminant analysis and Kmeans into a coherent framework, a dimension reduction algorithm was recently proposed to select the most discriminative subspace. This algorithm utilized the clustering method to generate cluster labels and after that employed discriminant analysis to do subspace selection. However, we found that this algorithm only considers the information of global structure, and does not take into account the information of local structure. In order to overcome the shortcoming mentioned above, this paper presents a dimension reduction algorithm preserving both global and local clustering structure. Our algorithm is an unsupervised linear dimension reduction algorithm suitable for the data with cloud distribution. In the proposed algorithm, the Kmeans clustering method is adopted to generate the clustering labels for all data in the original, space. And then, the obtained clustering labels are utilized to describe the global and local clustering structure. Finally, the objective function is established to preserve both the local and global clustering structure. By solving this objective function, the projection matrix and the corresponding subspace are yielded. In this way, the global and local information of the clustering structure are integrated into the process of the subspace selection, in fact, the structure discovery and the subspace selection are performed simultaneously in our algorithm. Encouraging experimental results are achieved on the artificial dataset, real-life benchmark dataset and AR face dataset. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:191 / 203
页数:13
相关论文
共 38 条
[1]  
[Anonymous], 2002, Principal components analysis
[2]  
[Anonymous], IEEE T ACOUSTICS SPE
[3]   Multiobjective clustering analysis using particle swarm optimization [J].
Armano, Giuliano ;
Farmani, Mohammad Reza .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 55 :184-193
[4]   A Comparative Review of Dimension Reduction Methods in Approximate Bayesian Computation [J].
Blum, M. G. B. ;
Nunes, M. A. ;
Prangle, D. ;
Sisson, S. A. .
STATISTICAL SCIENCE, 2013, 28 (02) :189-208
[5]   Randomized Dimensionality Reduction for k-Means Clustering [J].
Boutsidis, Christos ;
Zouzias, Anastasios ;
Mahoney, Michael W. ;
Drineas, Petros .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2015, 61 (02) :1045-1062
[6]  
Bühlmann P, 2011, SPRINGER SER STAT, P1, DOI 10.1007/978-3-642-20192-9
[7]   A comparative study of efficient initialization methods for the k-means clustering algorithm [J].
Celebi, M. Emre ;
Kingravi, Hassan A. ;
Vela, Patricio A. .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (01) :200-210
[8]   Feature extraction using fuzzy maximum margin criterion [J].
Cui, Yan ;
Fan, Liya .
NEUROCOMPUTING, 2012, 86 :52-58
[9]   Combining extreme learning machines using support vector machines for breast tissue classification [J].
Daliri, Mohammad Reza .
COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING, 2015, 18 (02) :185-191
[10]   Feature selection using binary particle swarm optimization and support vector machines for medical diagnosis [J].
Daliri, Mohammad Reza .
BIOMEDICAL ENGINEERING-BIOMEDIZINISCHE TECHNIK, 2012, 57 (05) :395-402