An On-device Robust Sound Recognition System for Real-time Context Awareness of Robots

被引:0
|
作者
Song, Ju-man [1 ]
Kim, Changmin [1 ]
Son, Jungkwan [1 ]
机构
[1] LG Elect, Adv Robot Lab, Seoul, South Korea
关键词
D O I
10.1109/RO-MAN60168.2024.10731337
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper suggests an on-device robust sound recognition system for robots in real-time. The proposed system is designed to enable the robot to detect a variety of sound events in a variety of locations, including noisy and reverberant sound environments. To use suggested system on target robots, two VGGish models are trained on sever-side and the pre-trained models infer using an audio topic from a on-device real-time buffer handling system. The buffer handling system and the training system of deep learning model are designed to get almost silmilar input audio stream with each normalization system. To get robust performance in various environments, we use log-mel feature for general environments and per-chennal energy normalization for noisy and reverberant environments. Each feature is switched and used in real time on the robot depending on the sound environment mode. Several experimental results demonstrate the robust performance of the proposed real-time robust sound recognition system on a target robot.
引用
收藏
页码:2212 / 2218
页数:7
相关论文
共 50 条
  • [21] MobiDepth: Real-Time Depth Estimation Using On-Device Dual Cameras
    Zhang, Jinrui
    Yang, Huan
    Ren, Ju
    Zhang, Deyu
    He, Bangwen
    Cao, Ting
    Li, Yuanchun
    Zhang, Yaoxue
    Liu, Yunxin
    PROCEEDINGS OF THE 2022 THE 28TH ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING, ACM MOBICOM 2022, 2022, : 528 - 541
  • [22] Robust real-time video face recognition system for unconstrained environments
    Rajak, Amir
    Dailey, Matthew N.
    Ekpanyapong, Mongkol
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2022, 2022, 12177
  • [23] Real-Time Sound Recognition System for Human Care Robot Considering Custom Sound Events
    Kim, Seong-Hu
    Nam, Hyeonuk
    Choi, Sang-Min
    Park, Yong-Hwa
    IEEE ACCESS, 2024, 12 : 42279 - 42294
  • [24] Robust Continuous On-device Personalization for Automatic Speech Recognition
    Sim, Khe Chai
    Chandorkar, Angad
    Gao, Fan
    Chua, Mason
    Munkhdalai, Tsendsuren
    Beaufays, Francoise
    INTERSPEECH 2021, 2021, : 1284 - 1288
  • [25] Real-Time Hardware Implementation of a Sound Recognition System with In-Field Learning
    Kugler, Mauricio
    Tossavainen, Teemu
    Nakatsu, Miku
    Kuroyanagi, Susumu
    Iwata, Akira
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (07): : 1885 - 1894
  • [26] Home-Based Real-Time Abnormal Movement Detection System Deployed on On-Device Artificial Intelligence
    Yan, Li-Hong
    Kao, Chiao-Wen
    Hwang, Bor-Jiunn
    Chen, Hui-Hui
    Huang, Hui-Chia
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (07)
  • [27] A Robust Real-time Tracking System based on an Adaptive Selection Mechanism for Mobile Robots
    Wang, Xin
    Rudinac, Maja
    Jonker, Pieter
    2012 12TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS & VISION (ICARCV), 2012, : 1065 - 1070
  • [28] Robust speech recognition system for communication robots in real environments
    Ishi, Carlos Toshinori
    Matsuda, Shigeki
    Kanda, Takayuki
    Jitsuhiro, Takatoshi
    Ishiguro, Hiroshi
    Nakamura, Satoshi
    Hagita, Norihiro
    2006 6TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS, VOLS 1 AND 2, 2006, : 340 - +
  • [29] Real-Time PPG-to-ECG Reconstruction Model With On-Device Recalibration Facility
    Murmu, Nirmal
    Gupta, Rajarshi
    Das Sharma, Kaushik
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [30] Real-time intelligent on-device monitoring of heart rate variability with PPG sensors
    Xu, Jingye
    Zhang, Yuntong
    Xie, Mimi
    Wang, Wei
    Zhu, Dakai
    JOURNAL OF SYSTEMS ARCHITECTURE, 2024, 154