Improvement of Visual Perception in Humanoid Robots Using Heterogeneous Architectures for Autonomous Applications

被引：0

作者：

Guajo, Joaquin ^{[1
]}

Alzate Anzola, Cristian ^{[1
]}

Betancur, Daniel ^{[2
]}

Castano-Londono, Luis ^{[1
]}

Marquez-Viloria, David ^{[1
]}

机构：

[1] Inst Tecnol Metropolitano ITM, Dept Elect & Telecommun Engn, Medellin, Colombia

[2] Inst Univ Envigado, Syst & Comp Sci Res Grp, Medellin, Colombia

来源：

APPLIED COMPUTER SCIENCES IN ENGINEERING, WEA 2021 | 2021年 / 1431卷

关键词：

CNN; Field programmable gate array (FPGA); System-on-a-Chip (SoC); High-level synthesis (HLS); Humanoid robot;

D O I：

10.1007/978-3-030-86702-7_38

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Humanoid robots find application in a variety of tasks such as emotional recognition for human-robot interaction (HRI). Despite their capabilities, these robots have a sequential computing system that limits the execution of high computational cost algorithms such as Convolutional Neural Networks (CNNs), which have shown good performance in recognition tasks. This limitation reduces their performance in HRI applications. As an alternative to sequential computing units are Field-programmable gate arrays (FPGAs) and Graphics Processing Units (GPUs), which have a high degree of parallelism, high performance, and low power consumption. In this paper, we propose a visual perception enhancement system for humanoid robots using FPGA or GPU based embedded systems running a CNN, while maintaining autonomy through an external computational system added to the robot structure. Our work has as a case study the humanoid robot NAO, however, the work can be replicated on other robots such as Pepper and Robotis OP3. The development boards used were the Xilinx Ultra96 FPGA, Intel Cyclone V SoC FPGA and Nvidia Jetson TX2 GPU. Nevertheless, our design allows the integration of other heterogeneous architectures with high parallelism and low power consumption. The Tinier-Yolo, Alexnet and Inception-V1 CNNs are executed and real-time results were obtained for the FPGA and GPU cards, while in Alexnet, the expected results were presented in the Jetson TX2.

引用

页码：447 / 458

页数：12

共 26 条

[21] Sermanet P., 2014, COMPUT VIS PATTERN R
[22] Shamsuddin S., 2012, 2012 IEEE 8th International Colloquium on Signal Processing & its Applications, P188, DOI 10.1109/CSPA.2012.6194716
[23] Wang D, 2017, 2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), P279, DOI 10.1109/FPT.2017.8280160
[24] Xu SY, 2018, INT CONF UNMAN AIRCR, P1336
[25] Zhang C., 2015, P 2015 ACM SIGDA INT, P161, DOI [10.1145/2684746.2689060, DOI 10.1145/2684746.2689060]
[26] Caffeine: Toward Uniformed Representation and Acceleration for Deep Convolutional Neural Networks
Zhang, Chen
Sun, Guangyu
Fang, Zhenman
Zhou, Peipei
Pan, Peichen
Cong, Jason
[J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2019, 38 (11) : 2072 - 2085

← 1 2 3 →