Beyond sparsity: The role of L1-optimizer in pattern classification

被引:176
作者
Yang, Jian [1 ,2 ]
Zhang, Lei [3 ,5 ]
Xu, Yong [4 ]
Yang, Jing-yu [1 ]
机构
[1] Nanjing Univ Sci & Technol, Dept Comp Sci, Nanjing 210094, Jiangsu, Peoples R China
[2] CALTECH, Pasadena, CA 91125 USA
[3] Hong Kong Polytech Univ, Biometr Res Ctr, Dept Comp, Kowloon, Hong Kong, Peoples R China
[4] Harbin Inst Technol, Biocomp Res Ctr, Shenzhen Grad Sch, Shenzhen, Peoples R China
[5] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4L8, Canada
基金
美国国家科学基金会;
关键词
Sparse representation; Pattern classification; Classifier; Feature extraction; FACE RECOGNITION; SIGNAL RECOVERY; ILLUMINATION; PROJECTIONS; IMAGE;
D O I
10.1016/j.patcog.2011.08.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The newly-emerging sparse representation-based classifier (SRC) shows great potential for pattern classification but lacks theoretical justification. This paper gives an insight into SRC and seeks reasonable supports for its effectiveness. SRC uses L-1-optimizer instead of L-0-optimizer on account of computational convenience and efficiency. We re-examine the role of L-1-optimizer and find that for pattern recognition tasks, L-1-optimizer provides more classification meaningful information than L-0-optimizer does. L-0-optimizer can achieve sparsity only, whereas L-1-optimizer can achieve closeness as well as sparsity. Sparsity determines a small number of nonzero representation coefficients, while closeness makes the nonzero representation coefficients concentrate on the training samples with the same class label as the given test sample. Thus, it is closeness that guarantees the effectiveness of the L-1-optimizer based SRC. Based on the closeness prior, we further propose two kinds of class L-1-optimizer classifiers (CL1C), the closeness rule based CL1C (C-CL1C) and its improved version: the Lasso rule based CL1C (L-CL1C). The proposed classifiers are evaluated on five databases and the experimental results demonstrate advantages of the proposed classifiers over SRC in classification performance and computational efficiency for large sample size problems. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1104 / 1118
页数:15
相关论文
共 53 条
[1]   K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].
Aharon, Michal ;
Elad, Michael ;
Bruckstein, Alfred .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322
[2]   On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems [J].
Amaldi, E ;
Kann, V .
THEORETICAL COMPUTER SCIENCE, 1998, 209 (1-2) :237-260
[3]  
[Anonymous], 2008, P IEEE C COMP VIS PA
[4]  
[Anonymous], 2009, IEEE C COMP VIS PATT
[5]  
[Anonymous], 2008, IEEE C COMP VIS PATT
[6]  
[Anonymous], 2009, P SIAM INT C DAT MIN
[7]  
[Anonymous], NEURAL INFORM PROCES
[8]  
[Anonymous], P IEEE
[9]  
[Anonymous], THESIS MIT
[10]   Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection [J].
Belhumeur, PN ;
Hespanha, JP ;
Kriegman, DJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) :711-720