Improvement of Visual Perception in Humanoid Robots Using Heterogeneous Architectures for Autonomous Applications

被引:0
作者
Guajo, Joaquin [1 ]
Alzate Anzola, Cristian [1 ]
Betancur, Daniel [2 ]
Castano-Londono, Luis [1 ]
Marquez-Viloria, David [1 ]
机构
[1] Inst Tecnol Metropolitano ITM, Dept Elect & Telecommun Engn, Medellin, Colombia
[2] Inst Univ Envigado, Syst & Comp Sci Res Grp, Medellin, Colombia
来源
APPLIED COMPUTER SCIENCES IN ENGINEERING, WEA 2021 | 2021年 / 1431卷
关键词
CNN; Field programmable gate array (FPGA); System-on-a-Chip (SoC); High-level synthesis (HLS); Humanoid robot;
D O I
10.1007/978-3-030-86702-7_38
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Humanoid robots find application in a variety of tasks such as emotional recognition for human-robot interaction (HRI). Despite their capabilities, these robots have a sequential computing system that limits the execution of high computational cost algorithms such as Convolutional Neural Networks (CNNs), which have shown good performance in recognition tasks. This limitation reduces their performance in HRI applications. As an alternative to sequential computing units are Field-programmable gate arrays (FPGAs) and Graphics Processing Units (GPUs), which have a high degree of parallelism, high performance, and low power consumption. In this paper, we propose a visual perception enhancement system for humanoid robots using FPGA or GPU based embedded systems running a CNN, while maintaining autonomy through an external computational system added to the robot structure. Our work has as a case study the humanoid robot NAO, however, the work can be replicated on other robots such as Pepper and Robotis OP3. The development boards used were the Xilinx Ultra96 FPGA, Intel Cyclone V SoC FPGA and Nvidia Jetson TX2 GPU. Nevertheless, our design allows the integration of other heterogeneous architectures with high parallelism and low power consumption. The Tinier-Yolo, Alexnet and Inception-V1 CNNs are executed and real-time results were obtained for the FPGA and GPU cards, while in Alexnet, the expected results were presented in the Jetson TX2.
引用
收藏
页码:447 / 458
页数:12
相关论文
共 26 条
  • [21] Sermanet P., 2014, COMPUT VIS PATTERN R
  • [22] Shamsuddin S., 2012, 2012 IEEE 8th International Colloquium on Signal Processing & its Applications, P188, DOI 10.1109/CSPA.2012.6194716
  • [23] Wang D, 2017, 2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), P279, DOI 10.1109/FPT.2017.8280160
  • [24] Xu SY, 2018, INT CONF UNMAN AIRCR, P1336
  • [25] Zhang C., 2015, P 2015 ACM SIGDA INT, P161, DOI [10.1145/2684746.2689060, DOI 10.1145/2684746.2689060]
  • [26] Caffeine: Toward Uniformed Representation and Acceleration for Deep Convolutional Neural Networks
    Zhang, Chen
    Sun, Guangyu
    Fang, Zhenman
    Zhou, Peipei
    Pan, Peichen
    Cong, Jason
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2019, 38 (11) : 2072 - 2085