An On-device Robust Sound Recognition System for Real-time Context Awareness of Robots

被引:0
|
作者
Song, Ju-man [1 ]
Kim, Changmin [1 ]
Son, Jungkwan [1 ]
机构
[1] LG Elect, Adv Robot Lab, Seoul, South Korea
关键词
D O I
10.1109/RO-MAN60168.2024.10731337
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper suggests an on-device robust sound recognition system for robots in real-time. The proposed system is designed to enable the robot to detect a variety of sound events in a variety of locations, including noisy and reverberant sound environments. To use suggested system on target robots, two VGGish models are trained on sever-side and the pre-trained models infer using an audio topic from a on-device real-time buffer handling system. The buffer handling system and the training system of deep learning model are designed to get almost silmilar input audio stream with each normalization system. To get robust performance in various environments, we use log-mel feature for general environments and per-chennal energy normalization for noisy and reverberant environments. Each feature is switched and used in real time on the robot depending on the sound environment mode. Several experimental results demonstrate the robust performance of the proposed real-time robust sound recognition system on a target robot.
引用
收藏
页码:2212 / 2218
页数:7
相关论文
共 50 条
  • [31] Real-Time Vegetables Recognition System based on Deep Learning Network for Agricultural Robots
    Zheng, Yang-yang
    Kong, Jian-lei
    Jin, Xue-bo
    Su, Ting-li
    Nie, Ming-jun
    Bai, Yu-ting
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 2223 - 2228
  • [32] Real-Time On-Device Continual Learning Based on a Combined Nearest Class Mean and Replay Method for Smartphone Gesture Recognition
    Park, Heon-Sung
    Sung, Min-Kyung
    Kim, Dae-Won
    Lee, Jaesung
    SENSORS, 2025, 25 (02)
  • [33] Multimodal Personality Prediction: A Real-Time Recognition System for Social Robots with Data Acquisition
    Bhin, Hyeonuk
    Lim, Yoonseob
    Choi, Jongsuk
    2024 21ST INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS, UR 2024, 2024, : 673 - 676
  • [34] PrimeEye: A real-time face detection and recognition system robust to illumination changes
    Choi, J
    Lee, S
    Lee, C
    Yi, J
    AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2001, 2091 : 360 - 365
  • [35] A Real-time Robust Facial Expression Recognition System using HOG Features
    Kumar, Pranav
    Happy, S. L.
    Routray, Aurobinda
    2016 INTERNATIONAL CONFERENCE ON COMPUTING, ANALYTICS AND SECURITY TRENDS (CAST), 2016, : 289 - 293
  • [36] Robust place recognition based on omnidirectional vision and real-time local visual features for mobile robots
    Lu, Huimin
    Li, Xun
    Zhang, Hui
    Zheng, Zhiqiang
    ADVANCED ROBOTICS, 2013, 27 (18) : 1439 - 1453
  • [37] Real-time Emotions Recognition System
    Silva, Vinicius
    Soares, Filomena
    Esteves, Joao S.
    Figueiredo, Joana
    Leao, Celina P.
    Santos, Cristina
    Pereira, Ana Paula
    2016 8TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT), 2016, : 201 - 206
  • [38] Real-time Social Touch Gesture Recognition for Sensate Robots
    Knight, Heather
    Toscano, Robert
    Stiehl, Walter D.
    Chang, Angela
    Wang, Yi
    Breazeal, Cynthia
    2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 3715 - 3720
  • [39] Wearable schizophrenia treatment with real-time affective context awareness
    Tarnanas, IA
    Kikis, VM
    MEDICINE MEETS VIRTUAL REALITY 11: NEXTMED: HEALTH HORIZON, 2003, 94 : 357 - 359
  • [40] Real-time face tracking and recognition using the mobile robots
    Lee, Min-Fan Ricky
    Li, Ying-Chi
    Chien, Ming-Yen
    ADVANCED ROBOTICS, 2015, 29 (03) : 187 - 208