CAM-YOLO: tomato detection and classification based on improved YOLOv5 using combining attention mechanism

被引:23
作者
Appe, Seetharam Nagesh [1 ,2 ]
Arulselvi, G. [1 ]
Balaji, G. N. [3 ]
机构
[1] Annamalai Univ, Dept Comp Sci & Engn, Chidambaram, Tamilnadu, India
[2] CVR Coll Engn, Dept Informat Technol, Hyderabad, India
[3] Vellore Inst Technol, Sch Comp Sci & Engn, Vellore, Tamilnadu, India
关键词
Object detection; Non-max suppression; Distance intersection over union; CBAM; FRUIT;
D O I
10.7717/peerj-cs.1463
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background. One of the key elements in maintaining the consistent marketing of tomato fruit is tomato quality. Since ripeness is the most important factor for tomato quality in the viewpoint of consumers, determining the stages of tomato ripeness is a fundamental industrial concern with regard to tomato production to obtain a high quality product. Since tomatoes are one of the most important crops in the world, automatic ripeness evaluation of tomatoes is a significant study topic as it may prove beneficial in ensuring an optimal production of high-quality product, increasing profitability. This article explores and categorises the various maturity/ripeness phases to propose an automated multi-class classification approach for tomato ripeness testing and evaluation. Methods. Object detection is the critical component in a wide variety of computer vision problems and applications such as manufacturing, agriculture, medicine, and autonomous driving. Due to the tomato fruits' complex identification background, texture disruption, and partial occlusion, the classic deep learning object detection approach (YOLO) has a poor rate of success in detecting tomato fruits. To figure out these issues, this article proposes an improved YOLOv5 tomato detection algorithm. The proposed algorithm CAM-YOLO uses YOLOv5 for feature extraction, target identification and Convolutional Block Attention Module (CBAM). The CBAM is added to the CAM-YOLO to focus the model on improving accuracy. Finally, non -maximum suppression and distance intersection over union (DIoU) are applied to enhance the identification of overlapping objects in the image.Results. Several images from the dataset were chosen for testing to assess the model's performance, and the detection performance of the CAM-YOLO and standard YOLOv5 models under various conditions was compared. The experimental results affirms that CAM-YOLO algorithm is efficient in detecting the overlapped and small tomatoes with an average precision of 88.1%.
引用
收藏
页数:18
相关论文
共 28 条
  • [1] Harvesting Robots for High-value Crops: State-of-the-art Review and Challenges Ahead
    Bac, C. Wouter
    van Henten, Eldert J.
    Hemming, Jochen
    Edan, Yael
    [J]. JOURNAL OF FIELD ROBOTICS, 2014, 31 (06) : 888 - 911
  • [2] Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
  • [3] An Industrial-Grade Solution for Crop Disease Image Detection Tasks
    Dai, Guowei
    Fan, Jingchao
    [J]. FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [4] Desai M., 2021, Clinical eHealth, V4, P1, DOI [DOI 10.1016/J.CEH.2020.11.002, 10.1016/J.CEH.2020.11.002]
  • [5] A Survey of Deep Learning-Based Object Detection
    Jiao, Licheng
    Zhang, Fan
    Liu, Fang
    Yang, Shuyuan
    Li, Lingling
    Feng, Zhixi
    Qu, Rong
    [J]. IEEE ACCESS, 2019, 7 : 128837 - 128868
  • [6] Enhanced Field-Based Detection of Potato Blight in Complex Backgrounds Using Deep Learning
    Johnson, Joe
    Sharma, Geetanjali
    Srinivasan, Srikant
    Masakapalli, Shyam Kumar
    Sharma, Sanjeev
    Sharma, Jagdev
    Dua, Vijay Kumar
    [J]. PLANT PHENOMICS, 2021, 2021
  • [7] LaboroAI, 2020, LAB TOM
  • [8] Focal Loss for Dense Object Detection
    Lin, Tsung-Yi
    Goyal, Priya
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 318 - 327
  • [9] Feature Pyramid Networks for Object Detection
    Lin, Tsung-Yi
    Dollar, Piotr
    Girshick, Ross
    He, Kaiming
    Hariharan, Bharath
    Belongie, Serge
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 936 - 944
  • [10] Linjuan Ma, 2019, Advances in Smart Vehicular Technology, Transportation, Communication and Applications. Proceeding of the Second International Conference on Smart Vehicular Technology, Transportation, Communication and Applications. Smart Innovation, Systems and Technologies (SIST 128), P193, DOI 10.1007/978-3-030-04585-2_23