Robust graph regularized unsupervised feature selection

被引:70
作者
Tang, Chang [1 ]
Zhu, Xinzhong [2 ]
Chen, Jiajia [3 ]
Wang, Pichao [4 ]
Liu, Xinwang [5 ]
Tian, Jie [2 ]
机构
[1] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Hubei, Peoples R China
[2] Xidian Univ, Sch Life Sci & Technol, Xian 710071, Shanxi, Peoples R China
[3] Xuzhou Med Coll, Huaian Peoples Hosp 2, Dept Pharm, Huaian 223002, Peoples R China
[4] Univ Wollongong, Sch Comp & Informat Technol, Wollongong, NSW 2522, Australia
[5] Natl Univ Def Technol, Sch Comp Sci, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Unsupervised feature selection; Local geometric structure; Graph regularization; Similarity preservation;
D O I
10.1016/j.eswa.2017.11.053
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent research indicates the critical importance of preserving local geometric structure of data in unsupervised feature selection (UFS), and the well studied graph Laplacian is usually deployed to capture this property. By using a squared l(2)-norm, we observe that conventional graph Laplacian is sensitive to noisy data, leading to unsatisfying data processing performance. To address this issue, we propose a unified UFS framework via feature self-representation and robust graph regularization, with the aim at reducing the sensitivity to outliers from the following two aspects: i) an l(2), (1)-norm is used to characterize the feature representation residual matrix; and ii) an l(1)-norm based graph Laplacian regularization term is adopted to preserve the local geometric structure of data. By this way, the proposed framework is able to reduce the effect of noisy data on feature selection. Furthermore, the proposed l(1)-norm based graph Laplacian is readily extendible, which can be easily integrated into other UFS methods and machine learning tasks with local geometrical structure of data being preserved. As demonstrated on ten challenging benchmark data sets, our algorithm significantly and consistently outperforms state-of-the-art UFS methods in the literature, suggesting the effectiveness of the proposed UFS framework. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:64 / 76
页数:13
相关论文
共 44 条
[1]  
[Anonymous], FOUND TRENDS MACH LE
[2]   A non-local algorithm for image denoising [J].
Buades, A ;
Coll, B ;
Morel, JM .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, :60-65
[3]  
Cai D, 2010, P 16 ACM SIGKDD INT, P333, DOI DOI 10.1145/1835804.1835848
[4]   Local Coordinates Alignment With Global Preservation for Dimensionality Reduction [J].
Chen, Jing ;
Ma, Zhengming ;
Liu, Yang .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (01) :106-117
[5]   Unsupervised probabilistic feature selection using ant colony optimization [J].
Dadaneh, Behrouz Zamani ;
Markid, Hossein Yeganeh ;
Zakerolhosseini, Ali .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 53 :27-42
[6]  
Dietterich T. G., 1991, AAAI-91. Proceedings Ninth National Conference on Artificial Intelligence, P572
[7]   Unsupervised feature selection applied to content-based retrieval of lung images [J].
Dy, JG ;
Brodley, CE ;
Kak, A ;
Broderick, LS ;
Aisen, AM .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (03) :373-378
[8]   Leonardo's Rule, Self-Similarity, and Wind-Induced Stresses in Trees [J].
Eloy, Christophe .
PHYSICAL REVIEW LETTERS, 2011, 107 (25)
[9]  
Freedman G, 2011, ACM T GRAPHIC, V30, P474
[10]  
Gu Q., 2011, Proceedings of the Twenty-Second international joint conference on Artificial Intelligence, V2, P1294