Unsupervised feature selection via maximum projection and minimum redundancy

被引:60
作者
Wang, Shiping [1 ,2 ]
Pedrycz, Witold [2 ,3 ]
Zhu, Qingxin [1 ]
Zhu, William [4 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[2] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 2G7, Canada
[3] Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, Poland
[4] Minnan Normal Univ, Lab Granular Comp, Zhangzhou 363000, Peoples R China
基金
中国国家自然科学基金;
关键词
Machine learning; Feature selection; Unsupervised learning; Matrix factorization; Kernel method; Minimum redundancy; MUTUAL INFORMATION;
D O I
10.1016/j.knosys.2014.11.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dimensionality reduction is an important and challenging task in machine learning and data mining. It can facilitate data clustering, classification and information retrieval. As an efficient technique for dimensionality reduction, feature selection is about finding a small feature subset preserving the most relevant information. In this paper, we propose a new criterion, called maximum projection and minimum redundancy feature selection, to address unsupervised learning scenarios. First, the feature selection is formalized with the use of the projection matrices and then characterized equivalently as a matrix factorization problem. Second, an iterative update algorithm and a greedy algorithm are proposed to tackle this problem. Third, kernel techniques are considered and the corresponding algorithm is also put forward. Finally, the proposed algorithms are compared with four state-of-the-art feature selection methods. Experimental results reported for six publicly datasets demonstrate the superiority of the proposed algorithms. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:19 / 29
页数:11
相关论文
共 29 条
[1]  
[Anonymous], 2006, FEATURE EXTRACTION F
[2]  
[Anonymous], 1997, ICML
[3]  
[Anonymous], 2001, Pattern Classification
[4]  
Cai D., 2010, P 16 ACM SIGKDD INT, P333
[5]  
Dy JG, 2004, J MACH LEARN RES, V5, P845
[6]   Efficient greedy feature selection for unsupervised learning [J].
Farahat, Ahmed K. ;
Ghodsi, Ali ;
Kamel, Mohamed S. .
KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 35 (02) :285-310
[7]   Research on collaborative negotiation for e-commerce. [J].
Feng, YQ ;
Lei, Y ;
Li, Y ;
Cao, RZ .
2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, :2085-2088
[8]  
Gu Q., 2011, PROC 22TH C UNCERTAI, P266
[9]  
Guyon Isabelle, 2003, Journal of Machine Learning, V3, P1157
[10]  
He X., 2005, Adv. Neural Inf. Process. Syst., V18, P507