Speech recognition system for a service robot - a performance evaluation

被引:0
|
作者
Alibegovic, Besim [1 ]
Prljaca, Naser [1 ]
Kimmel, Melanie [2 ]
Schultalbers, Matthias [2 ]
机构
[1] Univ Tuzla, Fac Elect Engn, Tuzla, Bosnia & Herceg
[2] IAV GmbH, Berlin, Germany
关键词
Speech recognition; ASR; WER; Kaldi; DeepSpeech; IBM Watson; Microsoft Azure; Google Cloud;
D O I
10.1109/icarcv50220.2020.9305342
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work we adapt and evaluate different solutions for automatic speech recognition (ASR) to be used as an HMI for the assistant robot. Two on-device solutions: Kaldi (DNN-HMM) and Mozilla's DeepSpeech (end-to-end), and three internet service APIs: IBM Watson, Microsoft Azure and Google Speech to Text are evaluated. The systems are adapted to the domain of robot commands and evaluated on a set of expected inputs. As the goal is to retain the ability to recognise general language, the systems are also evaluated on out of domain data.
引用
收藏
页码:1171 / 1176
页数:6
相关论文
共 50 条
  • [1] HMM and BPNN based Speech Recognition System for Home Service Robot
    Liu, Chih-Yin
    Hung, Tzu-Hsin
    Cheng, Kai-Chung
    Li, Tzuu-Hseng S.
    2013 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND INTELLIGENT SYSTEMS (ARIS), 2013, : 38 - 43
  • [2] Parameter Tuning of Robot Audition Using Speech Recognition System as Evaluation Function
    Matsumoto, Mitsuharu
    2015 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2015, : 275 - 278
  • [3] Assistive Robot for Speech Semantic Recognition System
    Mohamad, Siti Nur Ateeqa
    Isa, Khalid
    PROCEEDINGS OF THE 2018 7TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING (ICCCE), 2018, : 50 - 55
  • [4] Embedded speech recognition system for intelligent robot
    Hong, Qingyang
    Zhang, Caihong
    Chen, Xiaoyang
    Chen, Yan
    14TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND MACHINE VISION IN PRACTICE 2007, PROCEEDINGS, 2007, : 35 - +
  • [5] Performance evaluation of Hindi speech recognition system using optimized filterbanks
    Dua, Mohit
    Aggarwal, Rajesh Kumar
    Biswas, Mantosh
    ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2018, 21 (03): : 389 - 398
  • [6] Prosodic Events Recognition in Evaluation of Speech-Synthesis System Performance
    Mihelic, France
    Vesnicer, Bostjan
    Zibert, Janes
    Noeth, Elmar
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 419 - +
  • [7] Automatic Robot Processing Using Speech Recognition System
    Elavarasi, S.
    Suseendran, G.
    DATA MANAGEMENT, ANALYTICS AND INNOVATION, ICDMAI 2019, VOL 1, 2020, 1042 : 185 - 195
  • [8] Improving Speech Emotion Recognition System for a Social Robot with Speaker Recognition
    Juszkiewicz, Lukasz
    2014 19TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2014, : 921 - 925
  • [9] Service robot system with ability of affective and speech interaction
    School of Information Engineering, University of Science and Technology, Beijing 100083, China
    不详
    J. Comput. Inf. Syst., 2006, 1 (133-138):
  • [10] System-level modeling and performance evaluation of speech recognition system based on SystemC
    Liu, Jin-Wei
    Huang, Zhang-Qin
    Hou, Yi-Bin
    Huo, Si-Jia
    Wang, Jin-Jia
    Beijing Gongye Daxue Xuebao / Journal of Beijing University of Technology, 2010, 36 (01): : 117 - 123