Robust Keyword Spotting via Recycle-Pooling for Mobile Game

被引:0
|
作者
An, Shounan [1 ]
Kim, Youngsoo [1 ]
Xu, Hu [1 ]
Lee, Jinwoo [1 ]
Lee, Myungwoo [2 ]
Oh, Insoo [3 ]
机构
[1] Netmarble, NARC, Game Dev AI Team, Seoul, South Korea
[2] Netmarble, NARC, Game Contents AI Team, Seoul, South Korea
[3] Netmarble, NARC, Magellan Div, Seoul, South Korea
来源
INTERSPEECH 2019 | 2019年
关键词
keyword spotting; recycle-pooling; convolutional neural network; mobile games;
D O I
暂无
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
We present an effective method to solve a small-footprint keyword spotting (KWS) task via deep neural network for mobile game. Our goal is to improve the accuracy of KWS in various environments. To this end, we propose a new neural network layer named recycle-pooling. Extensive experiments indicate that our recycle-pooling based convolutional neural network (RP-CNN) indeed improves the performance of KWS in both clean and noisy data for mobile game. We will perform live demonstration of RP-CNN based KWS integrated into a full-sized, production-quality mobile game A3: Still Alive, which is one of the major games from Netmarble this year and will be available on market soon.
引用
收藏
页码:3661 / 3662
页数:2
相关论文
共 39 条
  • [21] Robust Keyword Spotting for Noisy Environments by Leveraging Speech Enhancement and Speech Presence Probability
    Yang, Chouchang
    Saidutta, Yashas Malur
    Srinivasa, Rakshith Sharma
    Lee, Ching-Hua
    Shen, Yilin
    Jin, Hongxia
    INTERSPEECH 2023, 2023, : 1638 - 1642
  • [22] Data-Adaptive Single-Pole Filtering of Magnitude Spectra for Robust Keyword Spotting
    Jayant Kumar Rout
    Gayadhar Pradhan
    Circuits, Systems, and Signal Processing, 2022, 41 : 3023 - 3039
  • [23] Data-Adaptive Single-Pole Filtering of Magnitude Spectra for Robust Keyword Spotting
    Rout, Jayant Kumar
    Pradhan, Gayadhar
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (05) : 3023 - 3039
  • [24] Audio-Visual Multi-person Keyword Spotting via Hybrid Fusion
    Su, Yuxin
    Miao, Ziling
    Liu, Hong
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT II, 2022, 13605 : 327 - 338
  • [25] MAX-POOLING LOSS TRAINING OF LONG SHORT-TERM MEMORY NETWORKS FOR SMALL-FOOTPRINT KEYWORD SPOTTING
    Sun, Ming
    Raju, Anirudh
    Tucker, George
    Panchapagesan, Sankaran
    Fu, Gengshen
    Mandal, Arindam
    Matsoukas, Spyros
    Strom, Nikko
    Vitaladevuni, Shiv
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 474 - 480
  • [26] Multi-stream LSTM-HMM decoding and histogram equalization for noise robust keyword spotting
    Woellmer, Martin
    Marchi, Erik
    Squartini, Stefano
    Schuller, Bjoern
    COGNITIVE NEURODYNAMICS, 2011, 5 (03) : 253 - 264
  • [27] TE-KWS: Text-Informed Speech Enhancement for Noise-Robust Keyword Spotting
    Liu, Dong
    Mao, Qirong
    Gao, Lijian
    Ren, Qinghua
    Chen, Zhenghan
    Dong, Ming
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 601 - 610
  • [28] Portable Keyword Spotting and Sound Source Detection System Design on Mobile Robot with Mini Microphone Array
    Andra, Muhammad Bagus
    Usagawa, Tsuyoshi
    2020 6TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2020, : 170 - 174
  • [29] FOCAL LOSS AND DOUBLE-EDGE-TRIGGERED DETECTOR FOR ROBUST SMALL-FOOTPRINT KEYWORD SPOTTING
    Liu, Bin
    Nie, Shuai
    Zhang, Yaping
    Liang, Shan
    Yang, Zhanlei
    Liu, Wenju
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6361 - 6365
  • [30] Multi-stream LSTM-HMM decoding and histogram equalization for noise robust keyword spotting
    Martin Wöllmer
    Erik Marchi
    Stefano Squartini
    Björn Schuller
    Cognitive Neurodynamics, 2011, 5 : 253 - 264