Large Kernel Separable Mixed ConvNet for Remote Sensing Scene Classification

被引：3

作者：

Zhang, Keqian ^{[1
]}

Cui, Tengfei ^{[2
]}

Wu, Wei ^{[3
]}

Zheng, Xueke ^{[1
]}

Cheng, Gang ^{[1
]}

机构：

[1] Henan Polytech Univ, Coll Surveying & Land Informat Engn, Jiaozuo 454000, Peoples R China

[2] Boston Univ, Metropolitan Coll, Boston, MA 02215 USA

[3] Guilin Univ Technol, Coll Geomat & Geoinformat, Guilin 541004, Peoples R China

来源：

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING | 2024年 / 17卷

关键词：

Channel separation and mixing; large kernel convolution; remote sensing; scene classification; IMAGERY; OBJECT; RECOGNITION; NETWORKS; FEATURES; BAG;

D O I：

10.1109/JSTARS.2024.3353796

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Among tasks related to intelligent interpretation of remote sensing data, scene classification mainly focuses on the holistic information of the entire scene. Compared with pixel-level or object-based tasks, it involves a richer semantic context, making it more challenging. With the rapid advancement of deep learning, convolutional neural networks (CNNs) have found widespread applications across various domains, and some work has introduced them into scene classification tasks. However, traditional convolution operations involve sliding small convolutional kernels across an image, primarily focusing on local details within a small receptive field. To achieve better modeling of the entire image, the smaller receptive field limits the ability of convolution operation to capture features over a broader range. To this end, we introduce large kernel CNNs into the scene classification task to expand the receptive field of the mode, which allows us to capture comprehensive nonlocal information while still acquiring rich local details. However, in addition to encoding spatial association, the effective information within the feature maps is also strongly channel related. Therefore, to fully model this channel dependency, a novel channel separation and mixing module has been designed to realize feature correlation in the channel dimension. The combination of them forms a large kernel separable mixed ConvNet, enabling the model to capture effective dependencies of feature maps in both spatial and channel dimensions, thus achieving enhanced feature expression. Extensive experiments conducted on three datasets have also validated the effectiveness of the proposed method.

引用

页码：4294 / 4303

页数：10

共 74 条

[1] BoVSG: bag of visual SubGraphs for remote sensing scene classification [J].

Amiri, Khitem ;

Farah, Mohamed ;

Leloglu, Ugur Murat .

INTERNATIONAL JOURNAL OF REMOTE SENSING, 2020, 41 (05) :1986-2003

[2]

[Anonymous], 2010, P 18 ACM SIGSPATIAL

[3] Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification [J].

Anwer, Rao Muhammad ;

Khan, Fahad Shahbaz ;

van de Weijer, Joost ;

Molinier, Matthieu ;

Laaksonen, Jorma .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 138 :74-85

[4] Remote Sensing Image Retrieval With Global Morphological Texture Descriptors [J].

Aptoula, Erchan .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (05) :3023-3034

[5] Simple Yet Effective Fine-Tuning of Deep CNNs Using an Auxiliary Classification Loss for Remote Sensing Scene Classification [J].

Bazi, Yakoub ;

Al Rahhal, Mohamad M. ;

Alhichri, Haikel ;

Alajlan, Naif .

REMOTE SENSING, 2019, 11 (24)

[6] A Multiple-Instance Densely-Connected ConvNet for Aerial Scene Classification [J].

Bi, Qi ;

Qin, Kun ;

Li, Zhili ;

Zhang, Han ;

Xu, Kai ;

Xia, Gui-Song .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :4911-4926

[7] Fusing Local and Global Features for High-Resolution Scene Classification [J].

Bian, Xiaoyong ;

Chen, Chen ;

Tian, Long ;

Du, Qian .

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (06) :2889-2901

[8] Feature Learning With Matrix Factorization Applied to Acoustic Scene Classification [J].

Bisot, Victor ;

Serizel, Romain ;

Essid, Slim ;

Richard, Gael .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) :1216-1229

[9] Bag of spatio-visual words for context inference in scene classification [J].

Bolovinou, A. ;

Pratikakis, I. ;

Perantonis, S. .

PATTERN RECOGNITION, 2013, 46 (03) :1039-1053

[10] Deep Feature Fusion for VHR Remote Sensing Scene Classification [J].

Chaib, Souleyman ;

Liu, Huan ;

Gu, Yanfeng ;

Yao, Hongxun .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08) :4775-4784

← 1 2 3 4 5 6 7 8 →