Joint dictionary and graph learning for unsupervised feature selection

被引:8
作者
Ding, Dediong [1 ]
Xia, Fei [2 ,3 ]
Yang, Xiaogao [4 ]
Tang, Chang [5 ]
机构
[1] Southwestern Univ Finance & Econ, Sch Stat, Chengdu 611130, Peoples R China
[2] Naval Univ Engn, Changsha 410073, Peoples R China
[3] NUDT, Opt Engn Postdoctoral Mobile Stn, Changsha 410073, Peoples R China
[4] Hunan Univ Arts & Sci, Coll Mech Engn, Changde 415000, Peoples R China
[5] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Unsupervised feature selection; Dictionary learning; Similarity graph learning; Local structure preservation; CLASSIFICATION; ALGORITHM;
D O I
10.1007/s10489-019-01561-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the explosion of unlabelled and high-dimensional data, unsupervised feature selection has become an critical and challenging problem in machine learning. Recently, data representation based model has been successfully deployed for unsupervised feature selection, which defines feature importance as the capability to represent original data via a reconstruction function. However, most existing algorithms conduct feature selection on original feature space, which will be affected by the noisy and redundant features of original feature space. In this paper, we investigate how to conduct feature selection on the dictionary basis space of the data, which can capture higher level and more abstract representation than original low-level representation. In addition, a similarity graph is learned simultaneously to preserve the local geometrical data structure which has been confirmed critical for unsupervised feature selection. In summary, we propose a model (referred to as DGL-UFS briefly) to integrate dictionary learning, similarity graph learning and feature selection into a uniform framework. Experiments on various types of real world datasets demonstrate the effectiveness of the proposed framework DGL-UFS.
引用
收藏
页码:1379 / 1397
页数:19
相关论文
共 56 条
[1]  
[Anonymous], 2011, COLUMBIA OBJECT IMAG
[2]  
[Anonymous], 2010, P 16 ACM SIGKDD INT, DOI [10.1145/1835804.1835848, DOI 10.1145/1835804.1835848]
[3]   ALGORITHM - SOLUTION OF MATRIX EQUATION AX+XB = C [J].
BARTELS, RH ;
STEWART, GW .
COMMUNICATIONS OF THE ACM, 1972, 15 (09) :820-&
[4]  
Boyd S., 2010, Found. Trends Mach. Learn, V1, P122, DOI DOI 10.1561/2200000016
[5]   A dimension reduction algorithm preserving both global and local clustering structure [J].
Cai, Weiling .
KNOWLEDGE-BASED SYSTEMS, 2017, 118 :191-203
[6]  
Dietterich T. G., 1991, AAAI-91. Proceedings Ninth National Conference on Artificial Intelligence, P572
[7]   Unsupervised feature selection applied to content-based retrieval of lung images [J].
Dy, JG ;
Brodley, CE ;
Kak, A ;
Broderick, LS ;
Aisen, AM .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (03) :373-378
[8]   Unsupervised feature selection based on the Morisita estimator of intrinsic dimension [J].
Golay, Jean ;
Kanevski, Mikhail .
KNOWLEDGE-BASED SYSTEMS, 2017, 135 :125-134
[9]  
Guyon Isabelle, 2003, J MACH LEARN RES, V3, P1157, DOI DOI 10.1162/153244303322753616
[10]  
Hall MA., 2000, P 17 INT C MACH LEAR, P359, DOI DOI 10.1016/J.ESWA.2015.02.049