Robust Image Analysis With Sparse Representation on Quantized Visual Features

被引:39
作者
Bao, Bing-Kun [1 ,2 ]
Zhu, Guangyu [3 ]
Shen, Jialie [4 ]
Yan, Shuicheng [5 ]
机构
[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing 100190, Peoples R China
[2] China Singapore Inst Digital Media, Singapore 119613, Singapore
[3] Univ Calif Berkeley, Berkeley, CA 94720 USA
[4] Singapore Management Univ, Singapore 188065, Singapore
[5] Natl Univ Singapore, Singapore 117576, Singapore
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Image classification; quantized visual feature; sparse representation; SHRINKAGE; ALGORITHM; SELECTION; SCALE;
D O I
10.1109/TIP.2012.2219543
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent techniques based on sparse representation (SR) have demonstrated promising performance in high-level visual recognition, exemplified by the highly accurate face recognition under occlusion and other sparse corruptions. Most research in this area has focused on classification algorithms using raw image pixels, and very few have been proposed to utilize the quantized visual features, such as the popular bag-of-words feature abstraction. In such cases, besides the inherent quantization errors, ambiguity associated with visual word assignment and misdetection of feature points, due to factors such as visual occlusions and noises, constitutes the major cause of dense corruptions of the quantized representation. The dense corruptions can jeopardize the decision process by distorting the patterns of the sparse reconstruction coefficients. In this paper, we aim to eliminate the corruptions and achieve robust image analysis with SR. Toward this goal, we introduce two transfer processes (ambiguity transfer and mis-detection transfer) to account for the two major sources of corruption as discussed. By reasonably assuming the rarity of the two kinds of distortion processes, we augment the original SR-based reconstruction objective with l(0)-norm regularization on the transfer terms to encourage sparsity and, hence, discourage dense distortion/transfer. Computationally, we relax the nonconvex l(0)-norm optimization into a convex l(1)-norm optimization problem, and employ the accelerated proximal gradient method to optimize the convergence provable updating procedure. Extensive experiments on four benchmark datasets, Caltech-101, Caltech-256, Corel-5k, and CMU pose, illumination, and expression, manifest the necessity of removing the quantization corruptions and the various advantages of the proposed framework.
引用
收藏
页码:860 / 871
页数:12
相关论文
共 42 条
  • [1] Face description with local binary patterns:: Application to face recognition
    Ahonen, Timo
    Hadid, Abdenour
    Pietikainen, Matti
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) : 2037 - 2041
  • [2] On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems
    Amaldi, E
    Kann, V
    [J]. THEORETICAL COMPUTER SCIENCE, 1998, 209 (1-2) : 237 - 260
  • [3] [Anonymous], 2007, ACM MULTIMEDIA, DOI DOI 10.1145/1291233.1291379
  • [4] Bao BK, 2009, P INT C INT MULT COM, P17
  • [5] A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems
    Beck, Amir
    Teboulle, Marc
    [J]. SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (01): : 183 - 202
  • [6] NESTA: A Fast and Accurate First-Order Method for Sparse Recovery
    Becker, Stephen
    Bobin, Jerome
    Candes, Emmanuel J.
    [J]. SIAM JOURNAL ON IMAGING SCIENCES, 2011, 4 (01): : 1 - 39
  • [7] Boyd S.P, 2004, Convex optimization, DOI [DOI 10.1017/CBO9780511804441, 10.1017/CBO9780511804441]
  • [8] From Sparse Solutions of Systems of Equations to Sparse Modeling of Signals and Images
    Bruckstein, Alfred M.
    Donoho, David L.
    Elad, Michael
    [J]. SIAM REVIEW, 2009, 51 (01) : 34 - 81
  • [9] Near-optimal signal recovery from random projections: Universal encoding strategies?
    Candes, Emmanuel J.
    Tao, Terence
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (12) : 5406 - 5425
  • [10] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893