MCA-YOLOv7: An Improved UAV Target Detection Algorithm Based on YOLOv7

被引:10
作者
Qin, Zhiyong [1 ]
Chen, Dike [1 ]
Wang, Hongyuan [1 ]
机构
[1] Changzhou Univ, Sch Comp Sci & Artificial Intelligence, Changzhou 213000, Peoples R China
基金
中国国家自然科学基金;
关键词
UAV; object detection; attention mechanism; context aggregation; loss function;
D O I
10.1109/ACCESS.2024.3378748
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Aiming at the problems of tiny targets, large target scale changes, and background information interference in target detection of UAV(Unmanned Aerial Vehicle) aerial images, a revised UAV target detection algorithm MCA-YOLOv7 based on YOLOv7 is proposed, and the algorithm advances from the following points: optimizing the FPN(Feature Pyramid Networks) structure to increase the small-target detection layer, and boosting the network's detection ability for small targets. To enhance the multi-scale feature extraction capability, the Efficient Multi-Scale Attention(EMA) is added. In order to reduce the complexity of the model and reduce the confusion of background information, the context aggregation block (CABlock) was introduced and improved, and an effective context aggregation block (ECABlock) was proposed. The loss function CIoU is enhanced and a new loss function FCIoU is proposed, which accelerates the convergence speed of the model, and obtains more accurate regression results. The experimental results demonstrate that the MCA-YOLOv7 model reduces the number of model parameters by 4.7 M and increases the average accuracy (mAP@0.5) by 2.9% when compared to YOLOv7 on the VisDrone2019 dataset. The new algorithm is more capable of handling situations involving UAV aerial photography.
引用
收藏
页码:42642 / 42650
页数:9
相关论文
共 34 条
[1]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[2]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[3]   Dual Attention Network for Unsupervised Domain Adaptive Person Re-Identification [J].
Chen, Haiqin ;
Wang, Hongyuan ;
Ding, Zongyuan ;
Li, Penghui .
IEEE ACCESS, 2023, 11 :88184-88192
[4]   VisDrone-DET2019: The Vision Meets Drone Object Detection in Image Challenge Results [J].
Du, Dawei ;
Zhu, Pengfei ;
Wen, Longyin ;
Bian, Xiao ;
Ling, Haibin ;
Hu, Qinghua ;
Peng, Tao ;
Zheng, Jiayu ;
Wang, Xinyao ;
Zhang, Yue ;
Bo, Liefeng ;
Shi, Hailin ;
Zhu, Rui ;
Kumar, Aashish ;
Li, Aijin ;
Zinollayev, Almaz ;
Askergaliyev, Anuar ;
Schumann, Arne ;
Mao, Binjie ;
Lee, Byeongwon ;
Liu, Chang ;
Chen, Changrui ;
Pan, Chunhong ;
Huo, Chunlei ;
Yu, Da ;
Cong, Dechun ;
Zeng, Dening ;
Pailla, Dheeraj Reddy ;
Li, Di ;
Wang, Dong ;
Cho, Donghyeon ;
Zhang, Dongyu ;
Bai, Furui ;
Jose, George ;
Gao, Guangyu ;
Liu, Guizhong ;
Xiong, Haitao ;
Qi, Hao ;
Wang, Haoran ;
Qiu, Heqian ;
Li, Hongliang ;
Lu, Huchuan ;
Kim, Ildoo ;
Kim, Jaekyum ;
Shen, Jane ;
Lee, Jihoon ;
Ge, Jing ;
Xu, Jingjing ;
Zhou, Jingkai ;
Meier, Jonas .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :213-226
[5]  
Gevorgyan Z, 2022, Arxiv, DOI arXiv:2205.12740
[6]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[7]   ReDet: A Rotation-equivariant Detector for Aerial Object Detection [J].
Han, Jiaming ;
Ding, Jian ;
Xue, Nan ;
Xia, Gui-Song .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :2785-2794
[8]   Coordinate Attention for Efficient Mobile Network Design [J].
Hou, Qibin ;
Zhou, Daquan ;
Feng, Jiashi .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13708-13717
[9]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
[10]  
Jiashuai Dai, 2021, 2021 IEEE 2nd International Conference on Information Technology, Big Data and Artificial Intelligence (ICIBA), P736, DOI 10.1109/ICIBA52610.2021.9688305