A Real-Time Sound Source Localization System for Robotic Vacuum Cleaners With a Microphone Array

被引:0
|
作者
Kim, Jun Hyung [1 ]
Kim, Taehan [1 ]
Kim, Seokhyun [2 ]
Song, Ju-Man [2 ]
Park, Yongjin [2 ]
Kim, Minook [2 ]
Son, Jungkwan [2 ]
Jeong, Jimann [2 ]
Park, Hyung-Min [1 ]
机构
[1] Sogang Univ, Dept Elect Engn, Seoul 04107, South Korea
[2] LG Elect CTO, Seoul 06772, South Korea
关键词
Speech enhancement; Robots; Real-time systems; Location awareness; Computational modeling; Correlation; Vacuum systems; Sensors; Direction-of-arrival estimation; Vectors; Deep neural networks (DNNs); ego-noise reduction; microphone array; real-time speech enhancement; sound source localization (SSL); TRACKING;
D O I
10.1109/JSEN.2024.3500007
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the progress of artificial intelligence (AI) technology, home appliances are becoming more advanced to enhance our quality of life. Many smart devices support speech interfaces, including voice commands and user location tracking. However, robotic vacuum cleaners generate strong ego-noise that distorts microphone signals, making it difficult to estimate the user's location. To solve this problem, we propose a real-time sound source localization (SSL) system for a robotic vacuum cleaner equipped with a microphone array. We design a system that consists of speech enhancement, voice activity detection (VAD), and SSL modules. The speech enhancement module includes TRU-Net-Light, which has lower computation and similar speech enhancement performance to tiny recurrent U-net (TRU-Net). The TRU-Net-Light reduces the number of channels to reduce the model size and applies a frequency-axis multihead self-attention to boost representational capacity. The finite state machine-based VAD is designed to detect voice active periods using the output of a speech enhancement module. Furthermore, we present a mask-weighted difference correlation vector and the singular value decomposition (SVD) with smoother coherence transform (DSVD-SCOT) that achieves robust localization performance in severely noisy environments. In the experimented robotic vacuum cleaner, the localization accuracy of the SSL system was 97.9% and 84.0% for signal-to-noise ratios (SNRs) of -3 and -8 dB, respectively. The proposed system was run in real-time, with a real-time factor (RTF) of 0.378, on a single Kryo 585 Silver core in the RB5 platform. A demo of the proposed system is available at https://youtu.be/3d3Cr-cs9aY.
引用
收藏
页码:1243 / 1252
页数:10
相关论文
共 50 条
  • [1] Research and implementation on a real-time microphone array sound source localization system
    Peng, Kui
    Wu, Xiaopei
    Luo, Yaqin
    Gong, Xiaoxiao
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON MULTIMEDIA TECHNOLOGY (ICMT-13), 2013, 84 : 589 - 595
  • [2] Real-Time Microphone Array Processing for Sound Source Separation and Localization
    Sun, Longji
    Cheng, Qi
    2013 47TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2013,
  • [3] Source counting in real-time sound source localization using a circular microphone array
    Pavlidi, Despoina
    Griffin, Anthony
    Puigt, Matthieu
    Mouchtaris, Athanasios
    2012 IEEE 7TH SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP (SAM), 2012, : 521 - 524
  • [4] Real-Time Multiple Sound Source Localization and Counting Using a Circular Microphone Array
    Pavlidi, Despoina
    Griffin, Anthony
    Puigt, Matthieu
    Mouchtaris, Athanasios
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (10): : 2193 - 2206
  • [5] Real-Time Sound Source Localization on an Embedded GPU Using a Spherical Microphone Array
    Belloch, Jose A.
    Cobos, Maximo
    Gonzalez, Alberto
    Quintana-Orti, Enrique S.
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 201 - 210
  • [6] Real-time 2 dimensional sound source localization by 128-channel huge microphone array
    Tamai, Y
    Kagami, S
    Mizoguchi, H
    Amemiya, Y
    Nagashima, K
    Takano, T
    RO-MAN 2004: 13TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, PROCEEDINGS, 2004, : 65 - 70
  • [7] Real-time multiple sound source localization and counting using a soundfield microphone
    Maoshen Jia
    Jundai Sun
    Changchun Bao
    Journal of Ambient Intelligence and Humanized Computing, 2017, 8 : 829 - 844
  • [8] Real-time multiple sound source localization and counting using a soundfield microphone
    Jia, Maoshen
    Sun, Jundai
    Bao, Changchun
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2017, 8 (06) : 829 - 844
  • [9] REAL-TIME MULTIPLE SOUND SOURCE LOCALIZATION USING A CIRCULAR MICROPHONE ARRAY BASED ON SINGLE-SOURCE CONFIDENCE MEASURES
    Pavlidi, Despoina
    Puigt, Matthieu
    Griffin, Anthony
    Mouchtaris, Athanasios
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2625 - 2628
  • [10] Real-Time Sound Source Localization
    Mandlik, Michal
    Nemec, Zdenek
    Dolecek, Radovan
    2012 13TH INTERNATIONAL RADAR SYMPOSIUM (IRS), 2012, : 322 - 325