Optimizing Loss Functions for You Only Look Once Models: Improving Object Detection in Agricultural Datasets

被引:0
作者
Matsui, Atsuki [1 ]
Ishibashi, Ryuto [1 ]
Meng, Lin [2 ]
机构
[1] Ritsumeikan Univ, Grad Sch Sci & Engn, 1-1-1 Nojihigashi, Shiga 5258577, Japan
[2] Ritsumeikan Univ, Coll Sci & Engn, 1-1-1 Nojihigashi, Shiga 5258577, Japan
关键词
object detection; YOLO; loss;
D O I
10.3390/computers14020044
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Japan faces a significant labor shortage due to an aging population, particularly in the agricultural sector. The rising average age of farmers and the declining participation of younger individuals threaten the sustainability of farming practices. These trends reduce the availability of agricultural labor and pose a risk to lowering Japan's food self-sufficiency rate. The reliance on food imports raises concerns regarding price fluctuations and sanitation standards. Moreover, the challenging working conditions in agriculture and a lack of technological innovation have hindered productivity and increased the burden on the existing workforce. To address these challenges, "smart agriculture" presents a promising solution. By leveraging advanced technologies such as sensors, drones, the Internet of Things (IoT), and automation, smart agriculture aims to optimize farm operations. Real-time data collection and AI-driven analysis play a crucial role in monitoring crop growth, assessing soil conditions, and improving overall efficiency. This study proposes enhancements to the YOLO (You Only Look Once) object detection model to develop an automated tomato harvesting system. This system uses a camera to detect tomatoes and assess their ripeness for harvest. Our objective is to streamline the harvesting process through AI technology. Our improved YOLO model integrates two novel loss functions to enhance detection accuracy. The first, "VSR", refines the model's ability to classify tomatoes and determine their harvest readiness. The second, "SBCE", enhances the detection of small tomatoes by training the model to recognize a range of object sizes within the dataset. These improvements have significantly increased the system's detection performance. Our experimental results demonstrate that the mean Average Precision (mAP) of YOLOv7-tiny improved from 61.81% to 70.21%. Additionally, the F1 score increased from 0.61 to 0.71 and the mean Intersection over Union (mIoU) rose from 65.03% to 66.44% on the tomato dataset. These findings underscore the potential of our proposed system to enhance efficiency in agricultural practices.
引用
收藏
页数:17
相关论文
共 27 条
[1]  
Carion N, 2020, Img Proc Comp Vis Re, V12346, P213, DOI 10.1007/978-3-030-58452-8_13
[2]   ArcFace: Additive Angular Margin Loss for Deep Face Recognition [J].
Deng, Jiankang ;
Guo, Jia ;
Xue, Niannan ;
Zafeiriou, Stefanos .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4685-4694
[3]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[4]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[5]   Deep Metric Learning Using Triplet Network [J].
Hoffer, Elad ;
Ailon, Nir .
SIMILARITY-BASED PATTERN RECOGNITION, SIMBAD 2015, 2015, 9370 :84-92
[6]  
Ishibashi R., 2023, P 2023 INT C ADV MEC, P1
[7]   Deteriorated Characters Restoration for Early Japanese Books Using Enhanced CycleGAN [J].
Kaneko, Hayata ;
Ishibashi, Ryuto ;
Meng, Lin .
HERITAGE, 2023, 6 (05) :4345-4361
[8]   Computer vision based system for apple surface defect detection [J].
Li, QZ ;
Wang, MH ;
Gu, WK .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2002, 36 (2-3) :215-223
[9]  
Lin TY, 2018, Arxiv, DOI arXiv:1708.02002
[10]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37