An Anchor-Free Target Detection Algorithm Combining Attention and Dilation Convolution

被引:0
作者
Xiong, Lei [1 ,2 ,3 ]
Wang, Fengsui [1 ,2 ,3 ]
Qian, Yaping [1 ,2 ,3 ]
Xu, Yue [1 ,2 ,3 ]
机构
[1] Anhui Polytech Univ, Sch Elect Engn, Wuhu 241000, Peoples R China
[2] Anhui Key Lab Detect Technol & Energy Saving Devi, Wuhu 241000, Peoples R China
[3] Minist Educ, Key Lab Adv Percept & Intelligent Control High En, Wuhu 241000, Peoples R China
来源
2022 INTERNATIONAL CONFERENCE ON FRONTIERS OF ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, FAIML | 2022年
关键词
target detection; Anchor-free; CenterNet; dilation convolution;
D O I
10.1109/FAIML57028.2022.00016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at the problem of insufficient target detection capability in CenterNet, an improved target detection model combining attention and cavity convolution is proposed. Firstly, in order to improve the ability of the network to obtain the semantic and location features of the target, an improved nonlocal attention mechanism module (CANL) is designed to capture the remote dependence of the target in the image along the channel domain and the spatial domain, respectively. Secondly, a multi-scale feature extraction network based on dilation convolution (MSNet) is designed to improve the expression ability of the network to different scale targets, the residual structure is used to fuse the receptive field features of multiple scales in parallel, and the feature information obtained by the target in the image at multiple scales is retained. Finally, the proposed algorithm is verified on PASCAL VOC dataset. The detection accuracy of the proposed algorithm is 2.65 % higher than that of the baseline algorithm CenterNet, which effectively improves the performance of the anchorless object detection algorithm.
引用
收藏
页码:34 / 38
页数:5
相关论文
共 17 条
[1]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[2]  
[伏轩仪 Fu Xuanyi], 2022, [计算机科学与探索, Journal of Frontiers of Computer Science & Technology], V16, P791
[3]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[4]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
[5]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[6]   YOLO9000: Better, Faster, Stronger [J].
Redmon, Joseph ;
Farhadi, Ali .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6517-6525
[7]   You Only Look Once: Unified, Real-Time Object Detection [J].
Redmon, Joseph ;
Divvala, Santosh ;
Girshick, Ross ;
Farhadi, Ali .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788
[8]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149
[9]   FCOS: Fully Convolutional One-Stage Object Detection [J].
Tian, Zhi ;
Shen, Chunhua ;
Chen, Hao ;
He, Tong .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9626-9635
[10]  
Wang F S, 2021, Laser & Optoelectronics Progress, V58, P405