Selected and refined local attention module for object detection

被引:3
作者
Luo, Xiaofan [1 ]
Hu, Haifeng [1 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Guangdong, Peoples R China
基金
美国国家科学基金会;
关键词
object detection; neural nets; feature extraction; image fusion; feature expression; object detection results; SRLAM; nonlocal neural networks; LAM; detection accuracy; original feature map; feature fused module; selected and refined local attention module;
D O I
10.1049/el.2020.0182
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Lacking enough feature expression on the shallow part of the network always hinders the object detection results by missing the position of small instances. To address this, the authors propose a selected and refined local attention module (SRLAM) for object detection. SRLAM tries to improve the feature expression of local areas by establishing a relationship graph between different channels. Inspired by the classical non-local neural networks for video classification, they present the local attention module (LAM) to more effectively use remote information. The LAM can suppress the influence of irrelevant areas for improving detection accuracy. Moreover, for further information utilisation, the weight map generated by LAM is integrated together with the original feature map and the information from the higher layer by a feature fused module. Experiments on the PASCAL VOC2007 show that the authors' model has good detection performance, the proposed model can achieve 81.6 mAP when the size of the input image is 300 x 300, that outperforms many other mainstream detectors.
引用
收藏
页码:712 / +
页数:2
相关论文
共 13 条
[1]   Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks [J].
Bell, Sean ;
Zitnick, C. Lawrence ;
Bala, Kavita ;
Girshick, Ross .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2874-2883
[2]  
Everingham M., 2007, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4
[3]   Object detection via a multi-region & semantic segmentation-aware CNN model [J].
Gidaris, Spyros ;
Komodakis, Nikos .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1134-1142
[4]   RON: Reverse Connection with Objectness Prior Networks for Object Detection [J].
Kong, Tao ;
Sun, Fuchun ;
Yao, Anbang ;
Liu, Huaping ;
Lu, Ming ;
Chen, Yurong .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5244-5252
[5]   RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation [J].
Lin, Guosheng ;
Milan, Anton ;
Shen, Chunhua ;
Reid, Ian .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5168-5177
[6]   Feature Pyramid Networks for Object Detection [J].
Lin, Tsung-Yi ;
Dollar, Piotr ;
Girshick, Ross ;
He, Kaiming ;
Hariharan, Bharath ;
Belongie, Serge .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944
[7]   Receptive Field Block Net for Accurate and Fast Object Detection [J].
Liu, Songtao ;
Huang, Di ;
Wang, Yunhong .
COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 :404-419
[8]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[9]   You Only Look Once: Unified, Real-Time Object Detection [J].
Redmon, Joseph ;
Divvala, Santosh ;
Girshick, Ross ;
Farhadi, Ali .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788
[10]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149