Multi-Scale Boosting Feature Encoding Network for Texture Recognition

被引:9
作者
Song, Kaiyou [1 ]
Yang, Hua [1 ]
Yin, Zhouping [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, State Key Lab Digital Mfg Equipment & Technol, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Encoding; Interference; Image coding; Boosting; Image recognition; Task analysis; Texture recognition; feature encoding; boosting; multi-scale; convolutional neural network; IMAGE; SCALE;
D O I
10.1109/TCSVT.2021.3051003
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Texture recognition remains a challenging visual task due to the complex appearance variations caused by scale changes in the real world. In most existing texture recognition methods, textures are represented at a single scale; thus, multi-scale texture information is not fully utilized, resulting in insufficient representation and inaccurate recognition. In this study, with the goal of addressing the challenge of scale changes, we propose a novel multi-scale boosting feature encoding network (MSBFEN) for accurate texture recognition. MSBFEN first extracts multi-scale features with multi-scale texture structure information under the guidance of texture priors using a novel prior-guided feature extraction (PFE) method. Then, a multi-scale texture encoding (MSTE) method is devised to capture discriminative multi-scale texture representations by encoding the extracted features. Finally, to fully utilize the multi-scale texture representations for accurate texture recognition, a novel multi-scale boosting learning (MSBL) method is proposed. In MSBL, the learning procedure for multi-scale texture recognition is boosted in a hierarchical, progressively reinforced manner, significantly addressing the challenge of scale changes and greatly enhancing the recognition accuracy. In addition, a novel outlier-aware texture encoding (OTE) method is proposed for robust texture encoding at each scale of MSTE. OTE can resist the influence of background interference and can further enhance the robustness of MSBFEN. In extensive experiments conducted on six challenging texture recognition datasets, namely, KTH-TIPS2b, FMD, DTD, MINC, GTOS and GTOS-mobile, MSBFEN achieves accuracies of 86.2%, 86.4%, 77.8%, 85.3%, 86.4% and 87.57%, respectively, representing state-of-the-art texture recognition performance.
引用
收藏
页码:4269 / 4282
页数:14
相关论文
共 55 条
[1]   Face description with local binary patterns:: Application to face recognition [J].
Ahonen, Timo ;
Hadid, Abdenour ;
Pietikainen, Matti .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) :2037-2041
[2]  
Ahonen Timo., 2007, Proceedings of the Finnish Signal Processing Symposium, FINSIG, P1
[3]   Distribution Cognisant Loss for Cross-Database Facial Age Estimation With Sensitivity Analysis [J].
Akbari, Ali ;
Awais, Muhammad ;
Feng, Zhenhua ;
Farooq, Ammarah ;
Kittler, Josef .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (04) :1869-1887
[4]   Using filter banks in Convolutional Neural Networks for texture classification [J].
Andrearczyk, Vincent ;
Whelan, Paulf. .
PATTERN RECOGNITION LETTERS, 2016, 84 :63-69
[5]  
Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]
[6]  
Bell S, 2015, PROC CVPR IEEE, P3479, DOI 10.1109/CVPR.2015.7298970
[7]   Deep convolutional network with locality and sparsity constraints for texture classification [J].
Bu, Xingyuan ;
Wu, Yuwei ;
Gao, Zhi ;
Jia, Yunde .
PATTERN RECOGNITION, 2019, 91 :34-46
[8]   PCANet: A Simple Deep Learning Baseline for Image Classification? [J].
Chan, Tsung-Han ;
Jia, Kui ;
Gao, Shenghua ;
Lu, Jiwen ;
Zeng, Zinan ;
Ma, Yi .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) :5017-5032
[9]   Deep Filter Banks for Texture Recognition, Description, and Segmentation [J].
Cimpoi, Mircea ;
Maji, Subhransu ;
Kokkinos, Iasonas ;
Vedaldi, Andrea .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 118 (01) :65-94
[10]   Describing Textures in the Wild [J].
Cimpoi, Mircea ;
Maji, Subhransu ;
Kokkinos, Iasonas ;
Mohamed, Sammy ;
Vedaldi, Andrea .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3606-3613