Pedestrian Target Detection Based on Attention Mechanism in Cloud Computing

被引:0
作者
Zhao, Lihua [1 ]
Zeng, Fanjun [1 ]
机构
[1] Guangzhou Vocat Coll Technol & Business, Fac Informat Engn, Guangzhou, Peoples R China
来源
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, MACHINE LEARNING AND PATTERN RECOGNITION, IPMLP 2024 | 2024年
关键词
UAV; CBAM; SIOU; YOLOv8;
D O I
10.1145/3700906.3700957
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As one of the most fundamental facial recognition technologies, pedestrian target detection faces particular challenges in UAV (unmanned aerial vehicle) aerial images where small targets are often shadowed or obscured. This paper proposes a Convolutional Block Attention Module (CBAM) UAV pedestrian detection resource allocation strategy for edge devices to tackle these challenges. In the study, the CBAM module was first added to enhance feature expression and improve the model's robustness. Second, the Scale Invariant Objective (SIOU) loss function was employed to raise recognition precision, resulting in a new generation of YOLOv8 pedestrian target detection based on the attention mechanism. Finally, extensive experiments were performed on the YOLOv8 baseline models, with results on our dataset demonstrating that the proposed strategy achieved state-of-the-art performance and efficiency across various model scales.
引用
收藏
页码:313 / 317
页数:5
相关论文
共 10 条
[1]  
Dolev D., 2012, P 3 INN THEOR COMP S, P68
[2]  
Ghodsi An, 2011, Computer Communication Review, V41, P507, DOI 10.1145/2018584.2018586
[3]   Multi-Resource Fair Queueing for Packet Processing [J].
Ghodsi, Ali ;
Sekar, Vyas ;
Zaharia, Matei ;
Stoica, Ion .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2012, 42 (04) :1-12
[4]   No Agent Left Behind: Dynamic Fair Division of Multiple Resources [J].
Kash, Ian ;
Procaccia, Ariel D. ;
Shah, Nisarg .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2014, 51 :579-603
[5]   A Systematic Review on Imbalanced Data Challenges in Machine Learning: Applications and Solutions [J].
Kaur, Harsurinder ;
Pannu, Husanbir Singh ;
Malhi, Avleen Kaur .
ACM COMPUTING SURVEYS, 2019, 52 (04)
[6]  
Liu Chen, 2023, Computer Science and Application, V13, P1092
[7]   The Burr type XII distribution as a failure model under various loss functions [J].
Moore, D ;
Papadopoulos, AS .
MICROELECTRONICS RELIABILITY, 2000, 40 (12) :2117-2122
[8]   Cake Cutting: Not Just Child's Play [J].
Procaccia, Ariel D. .
COMMUNICATIONS OF THE ACM, 2013, 56 (07) :78-87
[9]   Face Mask-Wearing Detection Model Based on Loss Function and Attention Mechanism [J].
Wang, Zhong ;
Sun, Wu ;
Zhu, Qiang ;
Shi, Peibei .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[10]   CBAM: Convolutional Block Attention Module [J].
Woo, Sanghyun ;
Park, Jongchan ;
Lee, Joon-Young ;
Kweon, In So .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :3-19