Neighbour feature attention-based pooling

被引:3
作者
Li, Xiaosong [1 ]
Wu, Yanxia [1 ]
Fu, Yan [1 ]
Tang, Chuheng [2 ]
Zhang, Lidan [3 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, 145 Nantong St, Harbin 150001, Heilongjiang, Peoples R China
[2] Shanghai Inst Satellite Engn, 3666 Yuanjiang Rd, Shanghai 201109, Peoples R China
[3] Intel Labs China, Beijing 100190, Peoples R China
关键词
Convolutional neural network; Pooling method; Neighbour feature attention;
D O I
10.1016/j.neucom.2022.05.094
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In modern convolutional neural networks (CNNs), the pooling layer is seen as one of the primary layers for building the CNN model, which effectively downscales the spatial size of feature maps to reduce memory consumption. Several types of pooling operations, such as average pooling, max pooling, and strided convolution, fail to capture the spatial dependence between the pooling region feature and its neighbour features. In this paper, we propose a simple but effective attention-based pooling method called Neighbour Feature Attention-Based Pooling (NFP), which integrates neighbour features of the pooling region to keep semantic continuity across multiple layers. NFP adopts attention weights encoding with neighbour features by depthwise convolution, which effectively directs local spatial pooling for learning discriminative features. Compared to other pooling methods, the proposed method generates more discriminative features directed by neighbour information of the pooling region. The experiments results show that it consistently improves the performance across various backbone architectures on image classification tasks.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:285 / 293
页数:9
相关论文
共 40 条
[1]  
Alex K., 2009, LEARNING MULTIPLE LA
[2]  
[Anonymous], 2013, P INT C LEARN REPR
[3]  
[Anonymous], 2020, PyTorch
[4]   UNet plus plus : A Nested U-Net Architecture for Medical Image Segmentation [J].
Zhou, Zongwei ;
Siddiquee, Md Mahfuzur Rahman ;
Tajbakhsh, Nima ;
Liang, Jianming .
DEEP LEARNING IN MEDICAL IMAGE ANALYSIS AND MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, DLMIA 2018, 2018, 11045 :3-11
[5]   Multi-scale stacking attention pooling for remote sensing scene classification [J].
Bi, Qi ;
Zhang, Han ;
Qin, Kun .
NEUROCOMPUTING, 2021, 436 :147-161
[6]   APDC-Net: Attention Pooling-Based Convolutional Network for Aerial Scene Classification [J].
Bi, Qi ;
Qin, Kun ;
Zhang, Han ;
Xie, Jiafen ;
Li, Zhili ;
Xu, Kai .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (09) :1603-1607
[7]   Learning the Best Pooling Strategy for Visual Semantic Embedding [J].
Chen, Jiacheng ;
Hu, Hexiang ;
Wu, Hao ;
Jiang, Yuning ;
Wang, Changhu .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15784-15793
[8]  
Chrabaszcz Patryk, 2017, ARXIV170708819 CS
[9]   CUNet: A Compact Unsupervised Network For Image Classification [J].
Dong, Le ;
He, Ling ;
Mao, Mengdie ;
Kong, Gaipeng ;
Wu, Xi ;
Zhang, Qianni ;
Cao, Xiaochun ;
Izquierdo, Ebroul .
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (08) :2012-2021
[10]  
Gao ZT, 2023, INT J COMPUT VISION, V131, P363, DOI [10.1109/ICCV.2019.00345, 10.1007/s11263-022-01707-4]