A lightweight improved YOLOv5s model and its deployment for detecting pitaya fruits in daytime and nighttime light-supplement environments

被引：69

作者：

Li, Hongwei ^{[1
,4
]}

Gu, Zenan ^{[1
]}

He, Deqiang ^{[1
,4
]}

Wang, Xicheng ^{[5
]}

Huang, Junduan ^{[2
]}

Mo, Yongmei ^{[1
]}

Li, Peiwei ^{[1
]}

Huang, Zhihao ^{[1
]}

Wu, Fengyun ^{[3
]}

机构：

[1] Guangxi Univ, Sch Mech Engn, 100 East Daxue Rd, Nanning 530004, Peoples R China

[2] South China Univ Technol, Sch Automat Sci & Engn, 381 Wushan Rd, Guangzhou 510641, Peoples R China

[3] Guangzhou Coll Commerce, Sch Modern Informat Ind, Guangzhou 510000, Peoples R China

[4] Guangxi Univ, Sch Mech Engn, Guangxi Key Lab Mfg Syst & Adv Mfg Technol, Nanning 530004, Peoples R China

[5] South China Agr Univ, Coll Engn, 483 Wushan Rd, Guangzhou 510642, Guangdong, Peoples R China

来源：

COMPUTERS AND ELECTRONICS IN AGRICULTURE | 2024年 / 220卷

关键词：

Pitaya fruit detection; Daytime and nighttime light -supplement envi; ronments; Android deployment; YOLOv5; RGB;

D O I：

10.1016/j.compag.2024.108914

中图分类号：

S [农业科学];

学科分类号：

09 ;

摘要：

Precise detection and low-cost deployment are the technological basis of intelligent fruit picking. This study proposes a lightweight improved YOLOv5s model to detect pitaya fruits in daytime and nighttime lightsupplement environments, and make it successfully deploy in an Android device. This model first uses the module of shufflenetv2 to reconstruct the YOLOv5s backbone network. Then, the study proposes a ConcentratedComprehensive Convolution Receptive Field Enhancement (C3RFE) module to improve the detection precision of pitaya fruits. Furthermore, a Bidirectional Feature Pyramid Network (BiFPN) feature fusion method is used to enhance the multi-scale feature fusion. Moreover, three optimized Squeeze-and-Excitation (SE) attention modules are added to make full use of the image feature information. Finally, a dynamic label allocation strategy simple Optimal Transport Assignment (simOTA) is used to optimize the YOLOv5s model original label allocation strategy. The experimental results show that the improved model achieves an average precision rate of 97.80 %, with frames per second (FPS) of 139 FPS in a GPU run environment. The model size is only 2.5 MB. Compared to the state-of-the-art SSD, Faster RCNN, YOLOv4, YOLOv4 tiny, YOLOv5s, YOLOv5Lite-s, YOLOXs, YOLOv7, YOLOv7-tiny, YOLOv8n and YOLOv8s, the improved YOLOv5s achieve preferred comprehensive performance in average precision rate, FPS and model size. When deploying this model on the Realme GT Android mobile phone by developing an application based on an NCNN framework, such an application accomplishes real-time pitaya fruit detection with an FPS exceeding 30 FPS. This study can provide technological support for a precise and effective pitaya fruit intelligent picking.

引用

页数：21

共 45 条

[1] Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks [J].

Chen, Jierun ;

Kao, Shiu-Hong ;

He, Hao ;

Zhuo, Weipeng ;

Wen, Song ;

Lee, Chul-Ho ;

Chan, S. -H. Gary .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :12021-12031

[2] Detecting ripe fruits under natural occlusion and illumination conditions [J].

Chen, Jiqing ;

Wu, Jiahua ;

Wang, Zhikui ;

Qiang, Hu ;

Cai, Ganwei ;

Tan, Chengzhi ;

Zhao, Chaoyang .

COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 190

[3] Exploratory factor analysis revealing complex structure [J].

Ertel, Suitbert .

PERSONALITY AND INDIVIDUAL DIFFERENCES, 2011, 50 (02) :196-200

[4] YOLO-Banana: A Lightweight Neural Network for Rapid Detection of Banana Bunches and Stalks in the Natural Environment [J].

Fu, Lanhui ;

Yang, Zhou ;

Wu, Fengyun ;

Zou, Xiangjun ;

Lin, Jiaquan ;

Cao, Yongjun ;

Duan, Jieli .

AGRONOMY-BASEL, 2022, 12 (02)

[5] Faster R-CNN-based apple detection in dense-foliage fruiting-wall trees using RGB and depth features for robotic harvesting [J].

Fu, Longsheng ;

Majeed, Yaqoob ;

Zhang, Xin ;

Karkee, Manoj ;

Zhang, Qin .

BIOSYSTEMS ENGINEERING, 2020, 197 :245-256

[6]

Ge Z, 2021, Arxiv, DOI arXiv:2107.08430

[7] OTA: Optimal Transport Assignment for Object Detection [J].

Ge, Zheng ;

Liu, Songtao ;

Liu, Zeming ;

Yoshie, Osamu ;

Sun, Jian .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :303-312

[8]

Gevorgyan Z, 2022, Arxiv, DOI arXiv:2205.12740

[9] Rich feature hierarchies for accurate object detection and semantic segmentation [J].

Girshick, Ross ;

Donahue, Jeff ;

Darrell, Trevor ;

Malik, Jitendra .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587

[10] GhostNet: More Features from Cheap Operations [J].

Han, Kai ;

Wang, Yunhe ;

Tian, Qi ;

Guo, Jianyuan ;

Xu, Chunjing ;

Xu, Chang .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1577-1586

← 1 2 3 4 5 →