Hand target detection based on improved YOLOv5

被引：1

作者：

Xu Z. ^{[1
]}

Meng J. ^{[1
]}

Fang J. ^{[1
]}

机构：

[1] Department of Mathematics and Computer Science, Tongling University, Anhui, Tongling

来源：

International Journal of Wireless and Mobile Computing | 2023年 / 25卷 / 04期

关键词：

convolutional neural network; hand detection; target detection; YOLOv5;

D O I：

10.1504/IJWMC.2023.135394

中图分类号：

学科分类号：

摘要：

With the growing maturity of deep learning-based target detection algorithms, their deployment in intelligent service robots for target detection has become popular nowadays, in order to improve the precision of real-time hand detection and recognition by intelligent service robots, enabling them to detect hands accurately in a variety of environments. This paper proposes a hand detection method based on improved YOLOv5 deep convolutional neural network. YOLOv5s is selected as the base target detection model, the SE attention module is added to the network neck detection layer to guide the model to pay more attention to the channel features of small target to improve the detection performance, and the detection layer is added to enhance the feature learning ability of the network for target regions. The loss function of the detection model is optimised according to the hand image features to improve the confidence of the prediction frame. The experimental results show that the proposed hand detection method based on the improved YOLOv5 deep convolutional neural network can achieve a precision of 99.02%, which is 6.54% better than the original YOLOv5. © 2023 Inderscience Enterprises Ltd.

引用

页码：353 / 361

页数：8

共 34 条

[21]

Poppinga B., Laue T., JET-Net: Real-time object detection for mobile robots, Robot World Cup, pp. 227-240, (2019)

[22]

Qing Y., Liu W., Feng L., Gao W., Improved Yolo network for free-angle remote sensing target detection, Remote Sensing, 13, 11, (2021)

[23]

Rastgoo R., Kiani K., Escalera S., Hand pose aware multimodal isolated sign language recognition, Multimedia Tools and Applications, 80, 1, pp. 127-163, (2021)

[24]

Redmon J., Farhadi A., YOLO9000: better, faster, stronger, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263-7271, (2017)

[25]

Redmon J., Farhadi A., Yolov3: an incremental improvement, (2018)

[26]

Redmon J., Divvala S., Girshick R., Farhadi A., You only look once: unified, real-time object detection, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, (2016)

[27]

Ren S., He K., Girshick R., Sun J., Faster R-CNN: towards real-time object detection with region proposal networks, Advances in Neural Information Processing Systems, 28, pp. 1-9, (2015)

[28]

Rezatofighi H., Tsoi N., Gwak J., Sadeghian A., Reid I., Savarese S., Generalized intersection over union: a metric and a loss for bounding box regression, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 658-666, (2019)

[29]

Song H., Zhang X., Song J., Zhao J., Detection and tracking of safety helmet based on DeepSort and YOLOv5, Multimedia Tools and Applications, pp. 1-14, (2022)

[30]

Tang L., Xie T., Yang Y., Wang H., Classroom behavior detection based on improved YOLOv5 algorithm combining multi-scale feature fusion and attention mechanism, Applied Sciences, 12, 13, (2022)

← 1 2 3 4 →