Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification

被引:230
作者
Anwer, Rao Muhammad [1 ]
Khan, Fahad Shahbaz [2 ]
van de Weijer, Joost [3 ]
Molinier, Matthieu [4 ]
Laaksonen, Jorma [1 ]
机构
[1] Aalto Univ, Sch Sci, Dept Comp Sci, Aalto, Finland
[2] Linkoping Univ, Comp Vis Lab, Linkoping, Sweden
[3] Univ Autonoma Barcelona, Comp Vis Ctr, CS Dept, Barcelona, Spain
[4] VTT Tech Res Ctr Finland Ltd, Remote Sensing Team, Espoo, Finland
基金
芬兰科学院;
关键词
Remote sensing; Deep learning; Scene classification; Local Binary Patterns; Texture analysis; EXTRACTION; FEATURES; MODEL; COLOR;
D O I
10.1016/j.isprsjprs.2018.01.023
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Designing discriminative powerful texture features robust to realistic imaging conditions is a challenging computer vision problem with many applications, including material recognition and analysis of satellite or aerial imagery. In the past, most texture description approaches were based on dense orderless statistical distribution of local features. However, most recent approaches to texture recognition and remote sensing scene classification are based on Convolutional Neural Networks (CNNs). The de facto practice when learning these CNN models is to use RGB patches as input with training performed on large amounts of labeled data (ImageNet). In this paper, we show that Local Binary Patterns (LBP) encoded CNN models, codenamed TEX-Nets, trained using mapped coded images with explicit LBP based texture information provide complementary information to the standard RGB deep models. Additionally, two deep architectures, namely early and late fusion, are investigated to combine the texture and color information. To the best of our knowledge, we are the first to investigate Binary Patterns encoded CNNs and different deep network fusion architectures for texture recognition and remote sensing scene classification. We perform comprehensive experiments on four texture recognition datasets and four remote sensing scene classification benchmarks: UC-Merced with 21 scene categories, WHU-RS19 with 19 scene classes, RSSCN7 with 7 categories and the recently introduced large scale aerial image dataset (AID) with 30 aerial scene types. We demonstrate that TEX-Nets provide complementary information to standard RGB deep model of the same network architecture. Our late fusion TEX-Net architecture always improves the overall performance compared to the standard RGB network on both recognition problems. Furthermore, our final combination leads to consistent improvement over the state-of-the-art for remote sensing scene classification. (C) 2018 International Society for Photogrammetry and Remote Sensing, Inc. (ISPRS). Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:74 / 85
页数:12
相关论文
共 93 条
[31]  
dos Santos J. A., 2010, VISAPP
[32]  
Eitel A, IROS
[33]   Noise tolerant local binary pattern operator for efficient texture analysis [J].
Fathi, Abdolhossein ;
Naghsh-Nilchi, Ahmad Reza .
PATTERN RECOGNITION LETTERS, 2012, 33 (09) :1093-1100
[34]   Convolutional Two-Stream Network Fusion for Video Action Recognition [J].
Feichtenhofer, Christoph ;
Pinz, Axel ;
Zisserman, Andrew .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1933-1941
[35]  
Fukui A., 2016, ARXIV160601847, P457, DOI DOI 10.18653/V1/D16-1044
[36]   High-Resolution SAR Image Classification via Deep Convolutional Autoencoders [J].
Geng, Jie ;
Fan, Jianchao ;
Wang, Hongyu ;
Ma, Xiaorui ;
Li, Baoming ;
Chen, Fuliang .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2015, 12 (11) :2351-2355
[37]  
GUO Z, 1971, TIP, V19, P1657, DOI DOI 10.1109/TIP.2010.2044957
[38]   A Completed Modeling of Local Binary Pattern Operator for Texture Classification [J].
Guo, Zhenhua ;
Zhang, Lei ;
Zhang, David .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (06) :1657-1663
[39]   Rotation invariant texture classification using LBP variance (LBPV) with global matching [J].
Guo, Zhenhua ;
Zhang, Lei ;
Zhang, David .
PATTERN RECOGNITION, 2010, 43 (03) :706-719
[40]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778