Robot ego-noise suppression with labanotation-template subtraction

被引:2
|
作者
Jaroslavceva, Jekaterina [1 ]
Wake, Naoki [1 ]
Sasabuchi, Kazuhiro [1 ]
Ikeuchi, Katsushi [1 ]
机构
[1] Microsoft, Appl Robot Res, Redmond, WA 98052 USA
关键词
ego-noise; labanotation; automatic speech recognition; human-robot interaction;
D O I
10.1002/tee.23523
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this study, we aim to improve automatic-speech-recognition (ASR) accuracy in the presence of robot ego-noise toward a better human-robot interaction. Although several noise reduction methods have been proposed to increase ASR accuracy or signal-to-noise ratio (SNR) by predicting ego-noises through a short-time motion-template subtraction or a neural network, these methods showed poor performance in some practical use cases, such as attenuating long-term motion-associated ego-noise. Based on the motion-template subtraction method, we address the problem of creating ego-noise templates associated with a wide variety of robot motions. For representing robot motions, we employ a dance notation referred to as Labanotation. The rationales behind our approach are: (i) Labanotation allows quantizing infinite motion patterns using a finite number of Labanotation combinations; (ii) Labanotation-based motion description is hardware-independent; and (iii) long-time noise templates facilitate the localization of noise templates in a speech-with-noise signal compared to short-time templates. The effectiveness of the Labanotation-template subtraction (LTS) method was tested for five commercial ASRs in terms of ASR accuracy, SNR, and source-to-distortion ratio. We show that LTS leads to a reasonable performance, comparable to the other methods. The contribution of this study is (i) to propose to use Labanotation to reasonably collect noise templates, (ii) to demonstrate the practical effectiveness of LTS as well as examples of Labanotations for household actions. (c) 2021 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC.
引用
收藏
页码:407 / 415
页数:9
相关论文
共 50 条
  • [31] A Robust Speech Recognition System against the Ego Noise of a Robot
    Ince, Goekhan
    Nakadai, Kazuhiro
    Rodemann, Tobias
    Tsujino, Hiroshi
    Imura, Jun-ichi
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2070 - +
  • [32] Ego noise cancellation of a robot using missing feature masks
    Ince, Goekhan
    Nakadai, Kazuhiro
    Rodemann, Tobias
    Tsujino, Hiroshi
    Imura, Jun-ichi
    APPLIED INTELLIGENCE, 2011, 34 (03) : 360 - 371
  • [33] Effective implementation of subtraction holography for noise suppression in particle fields
    Zhongshan Univ, Guangdong, China
    Appl Opt, 22 (4334-4336):
  • [34] Effective implementation of subtraction holography for noise suppression in particle fields
    Lai, TS
    Lin, WZ
    APPLIED OPTICS, 1996, 35 (22): : 4334 - 4336
  • [35] Diffusion Noise Suppression by Crystal-Shape Subtraction Array
    Tanaka, Akira
    Takahashi, Ryo
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [36] SPECKLE NOISE SUBTRACTION AND SUPPRESSION WITH ADAPTIVE OPTICS CORONAGRAPHIC IMAGING
    Ren, Deqing
    Dou, Jiangpei
    Zhang, Xi
    Zhu, Yongtian
    ASTROPHYSICAL JOURNAL, 2012, 753 (02):
  • [37] Comparison of spectral subtraction methods used in noise suppression algorithms
    Yektaeian, Mehdi
    Amirfattahi, Rassul
    2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 39 - 42
  • [38] A noise suppression method based on mutual control of spectral subtraction and spectral amplitude suppression
    Furuta, Satoru
    Takahashi, Shinya
    Nakajima, Kunio
    Systems and Computers in Japan, 2007, 38 (14) : 90 - 102
  • [39] Rapidly Learning Musical Beats in the Presence of Environmental and Robot Ego Noise
    Grunberg, David K.
    Kim, Youngmoo E.
    2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 1914 - 1919
  • [40] A PSYCHOACOUSTIC SPECTRAL SUBTRACTION METHOD FOR NOISE SUPPRESSION IN AUTOMATIC SPEECH RECOGNITION
    Haque, Serajul
    Togneri, Roberto
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1618 - 1621