Benchmarking edge computing devices for grape bunches and trunks detection using accelerated object detection single shot multibox deep learning models

被引:7
|
作者
Magalhaes, Sandro Costa [1 ,2 ]
dos Santos, Filipe Neves [2 ]
Machado, Pedro [3 ]
Moreira, Antonio Paulo [1 ,2 ]
Dias, Jorge [4 ,5 ]
机构
[1] INESC TEC Inst Engn Tecnol & Ciencia, Campus FEUP,Rua Dr Roberto Frias S-N, P-4200465 Porto, Porto, Portugal
[2] Univ Porto, Fac Engn, Campus FEUP,Rua Dr Roberto Frias S-N, P-4200465 Porto, Porto, Portugal
[3] Nottingham Trent Univ, Sch Sci & Technol, Dept Comp Sci, Computat Intelligence & Applicat Grp CIA, Clifton Campus, Nottingham NG11 8NS, England
[4] Khalifa Univ Ctr Autonomous Robot Syst KUCARS, Khalifa Univ Sci Technol & Res KU, 127788, Abu Dhabi, U Arab Emirates
[5] Univ Coimbra, Dept Elect Engn & Comp, Rua Silvio Lima, P-3030290 Coimbra, Portugal
基金
欧盟地平线“2020”;
关键词
Embedded systems; Heterogeneous platforms; Object detection; SSD resNet; RetinaNet resNet;
D O I
10.1016/j.engappai.2022.105604
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose: Visual perception enables robots to perceive the environment. Visual data is processed using computer vision algorithms that are usually time-expensive and require powerful devices to process the visual data in real-time, which is unfeasible for open-field robots with limited energy. This work benchmarks the performance of different heterogeneous platforms for object detection in real-time. This research benchmarks three architectures: embedded GPU-Graphical Processing Units (such as NVIDIA Jetson Nano 2 GB and 4 GB, and NVIDIA Jetson TX2), TPU-Tensor Processing Unit (such as Coral Dev Board TPU), and DPU-Deep Learning Processor Unit (such as in AMD-Xilinx ZCU104 Development Board, and AMD-Xilinx Kria KV260 Starter Kit). Methods: The authors used the RetinaNet ResNet-50 fine-tuned using the natural VineSet dataset. After the trained model was converted and compiled for target-specific hardware formats to improve the execution efficiency.Conclusions and Results: The platforms were assessed in terms of performance of the evaluation metrics and efficiency (time of inference). Graphical Processing Units (GPUs) were the slowest devices, running at 3 FPS to 5 FPS, and Field Programmable Gate Arrays (FPGAs) were the fastest devices, running at 14 FPS to 25 FPS. The efficiency of the Tensor Processing Unit (TPU) is irrelevant and similar to NVIDIA Jetson TX2. TPU and GPU are the most power-efficient, consuming about 5 W. The performance differences, in the evaluation metrics, across devices are irrelevant and have an F1 of about 70 % and mean Average Precision (mAP) of about 60 %.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Benchmarking Deep Learning Models for Object Detection on Edge Computing Devices
    Alqahtani, Daghash K.
    Cheema, Muhammad Aamir
    Toosi, Adel N.
    SERVICE-ORIENTED COMPUTING, ICSOC 2024, PT I, 2025, 15404 : 142 - 150
  • [2] Benchmarking Object Detection Deep Learning Models in Embedded Devices
    Cantero, David
    Esnaola-Gonzalez, Iker
    Miguel-Alonso, Jose
    Jauregi, Ekaitz
    SENSORS, 2022, 22 (11)
  • [3] VEHICLES DETECTION ON EXPRESSWAY VIA DEEP LEARNING: SINGLE SHOT MULTIBOX OBJECT DETECTOR
    Chen, Kuang-Hsuan
    Shou, Tawei David
    Li, John Kun-Han
    Tsai, Chun-Ming
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 2, 2018, : 467 - 473
  • [4] Object detection on remote sensing images using deep learning: an improved single shot multibox detector method
    Zhao, Kun
    Ren, Xiaoxi
    Kong, Zhenzhen
    Liu, Min
    JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (03)
  • [5] Evaluating the Single-Shot MultiBox Detector and YOLO Deep Learning Models for the Detection of Tomatoes in a Greenhouse
    Magalhaes, Sandro Augusto
    Castro, Luis
    Moreira, Germano
    dos Santos, Filipe Neves
    Cunha, Mario
    Dias, Jorge
    Moreira, Antonia Paulo
    SENSORS, 2021, 21 (10)
  • [6] Quantitative comparison and performance evaluation of deep learning-based object detection models on edge computing devices
    Lema, Dario G.
    Usamentiaga, Ruben
    Garcia, Daniel F.
    INTEGRATION-THE VLSI JOURNAL, 2024, 95
  • [7] Real Time Multi Object Detection for Blind Using Single Shot Multibox Detector
    Adwitiya Arora
    Atul Grover
    Raksha Chugh
    S. Sofana Reka
    Wireless Personal Communications, 2019, 107 : 651 - 661
  • [8] Real Time Multi Object Detection for Blind Using Single Shot Multibox Detector
    Arora, Adwitiya
    Grover, Atul
    Chugh, Raksha
    Reka, S. Sofana
    WIRELESS PERSONAL COMMUNICATIONS, 2019, 107 (01) : 651 - 661
  • [9] A Method for Optimizing Deep Learning Object Detection in Edge Computing
    Kim, Ryangsoo
    Kim, Geonyong
    Kim, Heedo
    Yoon, Giha
    Yoo, Hark
    11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 1164 - 1167
  • [10] A novel deep learning-based single shot multibox detector model for object detection in optical remote sensing images
    Wang, Liguo
    Shoulin, Yin
    Alyami, Hashem
    Laghari, Asif Ali
    Rashid, Mamoon
    Almotiri, Jasem
    Alyamani, Hasan J.
    Alturise, Fahad
    GEOSCIENCE DATA JOURNAL, 2024, 11 (03): : 237 - 251