Relaxed Block-Diagonal Dictionary Pair Learning With Locality Constraint for Image Recognition

被引:25
作者
Chen, Zhe [1 ]
Wu, Xiao-Jun [1 ]
Kittler, Josef [2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
基金
英国工程与自然科学研究理事会;
关键词
Discriminative dictionary pair learning; image recognition; locality constraint; relaxed block-diagonal (RBD) representation; FACE-RECOGNITION; SPARSE; PROJECTION; REPRESENTATION; ALGORITHM;
D O I
10.1109/TNNLS.2021.3053941
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel structured analysis-synthesis dictionary pair learning method for efficient representation and image classification, referred to as relaxed block-diagonal dictionary pair learning with a locality constraint (RBD-DPL). RBD-DPL aims to learn relaxed block-diagonal representations of the input data to enhance the discriminability of both analysis and synthesis dictionaries by dynamically optimizing the block-diagonal components of representation, while the off-block-diagonal counterparts are set to zero. In this way, the learned synthesis subdictionary is allowed to be more flexible in reconstructing the samples from the same class, and the analysis dictionary effectively transforms the original samples into a relaxed coefficient subspace, which is closely associated with the label information. Besides, we incorporate a locality-constraint term as a complement of the relaxation learning to enhance the locality of the analytical encoding so that the learned representation exhibits high intraclass similarity. A linear classifier is trained in the learned relaxed representation space for consistent classification. RBD-DPL is computationally efficient because it avoids both the use of class-specific complementary data matrices to learn discriminative analysis dictionary, as well as the time-consuming l(1)/l(0)-norm sparse reconstruction process. The experimental results demonstrate that our RBD-DPL achieves at least comparable or better recognition performance than the state-of-the-art algorithms. Moreover, both the training and testing time are significantly reduced, which verifies the efficiency of our method. The MATLAB code of the proposed RBD-DPL is available at https://github.com/chenzhe207/RBD-DPL.
引用
收藏
页码:3645 / 3659
页数:15
相关论文
共 61 条
[21]   Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories [J].
Li Fei-Fei ;
Fergus, Rob ;
Perona, Pietro .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2007, 106 (01) :59-70
[22]   DenseFuse: A Fusion Approach to Infrared and Visible Images [J].
Li, Hui ;
Wu, Xiao-Jun .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (05) :2614-2623
[23]   Discriminative Fisher Embedding Dictionary Learning Algorithm for Object Recognition [J].
Li, Zhengming ;
Zhang, Zheng ;
Qin, Jie ;
Zhang, Zhao ;
Shao, Ling .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (03) :786-800
[24]   Low-rank analysis-synthesis dictionary learning with adaptively ordinal locality [J].
Li, Zhengming ;
Zhang, Zheng ;
Qin, Jie ;
Li, Sheng ;
Cai, Hongmin .
NEURAL NETWORKS, 2019, 119 :93-112
[25]   A Locality-Constrained and Label Embedding Dictionary Learning Algorithm for Image Classification [J].
Li, Zhengming ;
Lai, Zhihui ;
Xu, Yong ;
Yang, Jian ;
Zhang, David .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (02) :278-293
[26]  
Lin Z., 2010, arXiv
[27]   Robust Recovery of Subspace Structures by Low-Rank Representation [J].
Liu, Guangcan ;
Lin, Zhouchen ;
Yan, Shuicheng ;
Sun, Ju ;
Yu, Yong ;
Ma, Yi .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (01) :171-184
[28]  
Martinez A.M., 1998, CVC Technical Report 24
[29]   Automated flower classification over a large number of classes [J].
Nilsback, Maria-Elena ;
Zisserman, Andrew .
SIXTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS & IMAGE PROCESSING ICVGIP 2008, 2008, :722-729
[30]  
Pham DS, 2008, PROC CVPR IEEE, P517