YOLOv4 Object Detection Algorithm with Efficient Channel Attention Mechanism

被引：23

作者：

Gao, Cui ^{[1
,2
]}

Cai, Qiang ^{[1
,2
]}

Ming, Shaofeng ^{[1
,2
]}

机构：

[1] Beijing Technol & Business Univ, Natl Engn Lab Agriprod Qual Traceabil, Beijing, Peoples R China

[2] BTBU, Beijing Key Lab Big Data Technol, Beijing, Peoples R China

来源：

2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020) | 2020年

关键词：

object detection; attention mechanism; deep learning;

D O I：

10.1109/ICMCCE51767.2020.00387

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Channel attention mechanism has been widely used in object detection algorithms because of its strong feature representation ability. The real-time object detection algorithm YOLOv4 has fast detection speed and high accuracy, but it still has some shortcomings, such as inaccurate bounding box positioning and poor robustness. Therefore, we introduced channel attention mechanism into the YOLOv4 algorithm to enhance the feature representation ability of images, and proposed a object detection algorithm with channel attention mechanism. This module firstly carries out global average pooling operation on the features extracted by YOLOv4, and then carries out local cross-channel interactive operation on the feature channels through one-dimensional convolution to enhance the correlation between the features of channels, so as to improve the positioning accuracy of YOLOv4. Our method has achieved good results in the PASCAL VOC dataset. Compared with the original YOLOv4 algorithm, the mAP of this algorithm in the PASCAL VOC test set is improved by 0.62%.

引用

页码：1764 / 1770

页数：7

共 17 条

[1]

Bochkovskiy A., 2020, COMPUTER VISION PATT

[2] Deformable Convolutional Networks [J].

Dai, Jifeng ;

Qi, Haozhi ;

Xiong, Yuwen ;

Li, Yi ;

Zhang, Guodong ;

Hu, Han ;

Wei, Yichen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773

[3] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

[4] Rich feature hierarchies for accurate object detection and semantic segmentation [J].

Girshick, Ross ;

Donahue, Jeff ;

Darrell, Trevor ;

Malik, Jitendra .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587

[5]

He KM, 2014, LECT NOTES COMPUT SC, V8691, P346, DOI [arXiv:1406.4729, 10.1007/978-3-319-10578-9_23]

[6]

Hillhouse Sun Jian, 2020, SOFTWARE, P46

[7]

Hoiem D., 2009, PASCAL CHALL WORKSH

[8] Feature Pyramid Networks for Object Detection [J].

Lin, Tsung-Yi ;

Dollar, Piotr ;

Girshick, Ross ;

He, Kaiming ;

Hariharan, Bharath ;

Belongie, Serge .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944

[9] Path Aggregation Network for Instance Segmentation [J].

Liu, Shu ;

Qi, Lu ;

Qin, Haifang ;

Shi, Jianping ;

Jia, Jiaya .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8759-8768

[10]

Redmon J, 2018, Arxiv, DOI [arXiv:1804.02767, DOI 10.48550/ARXIV.1804.02767]

← 1 2 →