Benchmarking edge computing devices for grape bunches and trunks detection using accelerated object detection single shot multibox deep learning models

被引：7

作者：

Magalhaes, Sandro Costa ^{[1
,2
]}

dos Santos, Filipe Neves ^{[2
]}

Machado, Pedro ^{[3
]}

Moreira, Antonio Paulo ^{[1
,2
]}

Dias, Jorge ^{[4
,5
]}

机构：

[1] INESC TEC Inst Engn Tecnol & Ciencia, Campus FEUP,Rua Dr Roberto Frias S-N, P-4200465 Porto, Porto, Portugal

[2] Univ Porto, Fac Engn, Campus FEUP,Rua Dr Roberto Frias S-N, P-4200465 Porto, Porto, Portugal

[3] Nottingham Trent Univ, Sch Sci & Technol, Dept Comp Sci, Computat Intelligence & Applicat Grp CIA, Clifton Campus, Nottingham NG11 8NS, England

[4] Khalifa Univ Ctr Autonomous Robot Syst KUCARS, Khalifa Univ Sci Technol & Res KU, 127788, Abu Dhabi, U Arab Emirates

[5] Univ Coimbra, Dept Elect Engn & Comp, Rua Silvio Lima, P-3030290 Coimbra, Portugal

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 117卷

基金：

欧盟地平线“2020”;

关键词：

Embedded systems; Heterogeneous platforms; Object detection; SSD resNet; RetinaNet resNet;

D O I：

10.1016/j.engappai.2022.105604

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Purpose: Visual perception enables robots to perceive the environment. Visual data is processed using computer vision algorithms that are usually time-expensive and require powerful devices to process the visual data in real-time, which is unfeasible for open-field robots with limited energy. This work benchmarks the performance of different heterogeneous platforms for object detection in real-time. This research benchmarks three architectures: embedded GPU-Graphical Processing Units (such as NVIDIA Jetson Nano 2 GB and 4 GB, and NVIDIA Jetson TX2), TPU-Tensor Processing Unit (such as Coral Dev Board TPU), and DPU-Deep Learning Processor Unit (such as in AMD-Xilinx ZCU104 Development Board, and AMD-Xilinx Kria KV260 Starter Kit). Methods: The authors used the RetinaNet ResNet-50 fine-tuned using the natural VineSet dataset. After the trained model was converted and compiled for target-specific hardware formats to improve the execution efficiency.Conclusions and Results: The platforms were assessed in terms of performance of the evaluation metrics and efficiency (time of inference). Graphical Processing Units (GPUs) were the slowest devices, running at 3 FPS to 5 FPS, and Field Programmable Gate Arrays (FPGAs) were the fastest devices, running at 14 FPS to 25 FPS. The efficiency of the Tensor Processing Unit (TPU) is irrelevant and similar to NVIDIA Jetson TX2. TPU and GPU are the most power-efficient, consuming about 5 W. The performance differences, in the evaluation metrics, across devices are irrelevant and have an F1 of about 70 % and mean Average Precision (mAP) of about 60 %.

引用

页数：15

共 50 条

[1] Benchmarking Deep Learning Models for Object Detection on Edge Computing Devices
Alqahtani, Daghash K.
Cheema, Muhammad Aamir
Toosi, Adel N.
SERVICE-ORIENTED COMPUTING, ICSOC 2024, PT I, 2025, 15404 : 142 - 150
[2] Benchmarking Object Detection Deep Learning Models in Embedded Devices
Cantero, David
Esnaola-Gonzalez, Iker
Miguel-Alonso, Jose
Jauregi, Ekaitz
SENSORS, 2022, 22 (11)
[3] VEHICLES DETECTION ON EXPRESSWAY VIA DEEP LEARNING: SINGLE SHOT MULTIBOX OBJECT DETECTOR
Chen, Kuang-Hsuan
Shou, Tawei David
Li, John Kun-Han
Tsai, Chun-Ming
PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 2, 2018, : 467 - 473
[4] Object detection on remote sensing images using deep learning: an improved single shot multibox detector method
Zhao, Kun
Ren, Xiaoxi
Kong, Zhenzhen
Liu, Min
JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (03)
[5] Evaluating the Single-Shot MultiBox Detector and YOLO Deep Learning Models for the Detection of Tomatoes in a Greenhouse
Magalhaes, Sandro Augusto
Castro, Luis
Moreira, Germano
dos Santos, Filipe Neves
Cunha, Mario
Dias, Jorge
Moreira, Antonia Paulo
SENSORS, 2021, 21 (10)
[6] Quantitative comparison and performance evaluation of deep learning-based object detection models on edge computing devices
Lema, Dario G.
Usamentiaga, Ruben
Garcia, Daniel F.
INTEGRATION-THE VLSI JOURNAL, 2024, 95
[7] Real Time Multi Object Detection for Blind Using Single Shot Multibox Detector
Adwitiya Arora
Atul Grover
Raksha Chugh
S. Sofana Reka
Wireless Personal Communications, 2019, 107 : 651 - 661
[8] Real Time Multi Object Detection for Blind Using Single Shot Multibox Detector
Arora, Adwitiya
Grover, Atul
Chugh, Raksha
Reka, S. Sofana
WIRELESS PERSONAL COMMUNICATIONS, 2019, 107 (01) : 651 - 661
[9] A Method for Optimizing Deep Learning Object Detection in Edge Computing
Kim, Ryangsoo
Kim, Geonyong
Kim, Heedo
Yoon, Giha
Yoo, Hark
11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 1164 - 1167
[10] A novel deep learning-based single shot multibox detector model for object detection in optical remote sensing images
Wang, Liguo
Shoulin, Yin
Alyami, Hashem
Laghari, Asif Ali
Rashid, Mamoon
Almotiri, Jasem
Alyamani, Hasan J.
Alturise, Fahad
GEOSCIENCE DATA JOURNAL, 2024, 11 (03): : 237 - 251

← 1 2 3 4 5 →