Discriminative Dictionary Pair Learning With Scale-Constrained Structured Representation for Image Classification

被引:10
作者
Chen, Zhe [1 ]
Wu, Xiao-Jun [1 ]
Xu, Tianyang [1 ]
Kittler, Josef [2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
基金
中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Dictionaries; Training; Representation learning; Feature extraction; Encoding; Learning systems; Data mining; Block-diagonal structure; image classification; scale-constrained discriminative representation learning; structured dictionary pair learning (DPL); within-class similarity preservation; FACE-RECOGNITION; SPARSE REPRESENTATION; K-SVD; REGRESSION; ALGORITHM;
D O I
10.1109/TNNLS.2022.3165217
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The dictionary pair learning (DPL) model aims to design a synthesis dictionary and an analysis dictionary to accomplish the goal of rapid sample encoding. In this article, we propose a novel structured representation learning algorithm based on the DPL for image classification. It is referred to as discriminative DPL with scale-constrained structured representation (DPL-SCSR). The proposed DPL-SCSR utilizes the binary label matrix of dictionary atoms to project the representation into the corresponding label space of the training samples. By imposing a non-negative constraint, the learned representation adaptively approximates a block-diagonal structure. This innovative transformation is also capable of controlling the scale of the block-diagonal representation by enforcing the sum of within-class coefficients of each sample to 1, which means that the dictionary atoms of each class compete to represent the samples from the same class. This implies that the requirement of similarity preservation is considered from the perspective of the constraint on the sum of coefficients. More importantly, the DPL-SCSR does not need to design a classifier in the representation space as the label matrix of the dictionary can also be used as an efficient linear classifier. Finally, the DPL-SCSR imposes the l(2,p)-norm on the analysis dictionary to make the process of feature extraction more interpretable. The DPL-SCSR seamlessly incorporates the scale-constrained structured representation learning, within-class similarity preservation of representation, and the linear classifier into one regularization term, which dramatically reduces the complexity of training and parameter tuning. The experimental results on several popular image classification datasets show that our DPL-SCSR can deliver superior performance compared with the state-of-the-art (SOTA) dictionary learning methods. The MATLAB code of this article is available at https://github.com/chenzhe207/DPL-SCSR.
引用
收藏
页码:10225 / 10239
页数:15
相关论文
共 52 条
[1]   K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].
Aharon, Michal ;
Elad, Michael ;
Bruckstein, Alfred .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322
[2]  
[Anonymous], 2010, P ADV NEUR INF PROC, DOI DOI 10.1007/978-3-319-10690-8_12
[3]   Evaluating Open-Universe Face Identification on the Web [J].
Becker, Brian C. ;
Ortiz, Enrique G. .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, :904-911
[4]   Distributed optimization and statistical learning via the alternating direction method of multipliers [J].
Boyd S. ;
Parikh N. ;
Chu E. ;
Peleato B. ;
Eckstein J. .
Foundations and Trends in Machine Learning, 2010, 3 (01) :1-122
[5]   Learning Alternating Deep-Layer Cascaded Representation [J].
Chen, Zhe ;
Wu, Xiao-Jun ;
Xu, Tianyang ;
Kittler, Josef .
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 :1520-1524
[6]   Noise-robust dictionary learning with slack block-Diagonal structure for face recognition [J].
Chen, Zhe ;
Wu, Xiao-Jun ;
Yin, He-Feng ;
Kittler, Josef .
PATTERN RECOGNITION, 2020, 100
[7]   A sparse regularized nuclear norm based matrix regression for face recognition with contiguous occlusion [J].
Chen, Zhe ;
Wu, Xiao-Jun ;
Kittler, Josef .
PATTERN RECOGNITION LETTERS, 2019, 125 :494-499
[8]  
Dong WS, 2011, PROC CVPR IEEE, P457, DOI 10.1109/CVPR.2011.5995478
[9]   Approximate Low-Rank Projection Learning for Feature Extraction [J].
Fang, Xiaozhao ;
Han, Na ;
Wu, Jigang ;
Xu, Yong ;
Yang, Jian ;
Wong, Wai Keung ;
Li, Xuelong .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) :5228-5241
[10]   Learning robust and discriminative low-rank representations for face recognition with occlusion [J].
Gao, Guangwei ;
Yang, Jian ;
Jing, Xiao-Yuan ;
Shen, Fumin ;
Yang, Wankou ;
Yue, Dong .
PATTERN RECOGNITION, 2017, 66 :129-143