Robust Keyword Spotting via Recycle-Pooling for Mobile Game

被引:0
|
作者
An, Shounan [1 ]
Kim, Youngsoo [1 ]
Xu, Hu [1 ]
Lee, Jinwoo [1 ]
Lee, Myungwoo [2 ]
Oh, Insoo [3 ]
机构
[1] Netmarble, NARC, Game Dev AI Team, Seoul, South Korea
[2] Netmarble, NARC, Game Contents AI Team, Seoul, South Korea
[3] Netmarble, NARC, Magellan Div, Seoul, South Korea
来源
INTERSPEECH 2019 | 2019年
关键词
keyword spotting; recycle-pooling; convolutional neural network; mobile games;
D O I
暂无
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
We present an effective method to solve a small-footprint keyword spotting (KWS) task via deep neural network for mobile game. Our goal is to improve the accuracy of KWS in various environments. To this end, we propose a new neural network layer named recycle-pooling. Extensive experiments indicate that our recycle-pooling based convolutional neural network (RP-CNN) indeed improves the performance of KWS in both clean and noisy data for mobile game. We will perform live demonstration of RP-CNN based KWS integrated into a full-sized, production-quality mobile game A3: Still Alive, which is one of the major games from Netmarble this year and will be available on market soon.
引用
收藏
页码:3661 / 3662
页数:2
相关论文
共 39 条
  • [31] Production federated keyword spotting via distillation, filtering, and joint federated-centralized training
    Hard, Andrew
    Partridge, Kurt
    Chen, Neng
    Augenstein, Sean
    Shah, Aishanee
    Park, Hyun Jin
    Park, Alex
    Ng, Sara
    Nguyen, Jessica
    Moreno, Ignacio Lopez
    Mathews, Rajiv
    Beaufays, Francoise
    INTERSPEECH 2022, 2022, : 76 - 80
  • [32] Multi-class AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data
    Xu, Menglong
    Li, Shengqiang
    Liang, Chengdong
    Zhang, Xiao-Lei
    INTERSPEECH 2022, 2022, : 3278 - 3282
  • [33] DCCRN-KWS: An Audio Bias Based Model for Noise Robust Small-Footprint Keyword Spotting
    Lv, Shubo
    Wang, Xiong
    Sun, Sining
    Ma, Long
    Xie, Lei
    INTERSPEECH 2023, 2023, : 929 - 933
  • [34] Robust Small-Footprint Keyword Spotting Using Sequence-To-Sequence Model With Connectionist Temporal Classifier
    Xuan, Xiaoguang
    Wang, Mingjiang
    Zhang, Xin
    Sun, Fengjiao
    2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 400 - 404
  • [35] On-Device Constrained Self-Supervised Speech Representation Learning for Keyword Spotting via Knowledge Distillation
    Yang, Gene-Ping
    Gu, Yue
    Tang, Qingming
    Du, Dongsu
    Liu, Yuzong
    INTERSPEECH 2023, 2023, : 1623 - 1627
  • [36] AUTOMATIC GAIN CONTROL AND MULTI-STYLE TRAINING FOR ROBUST SMALL-FOOTPRINT KEYWORD SPOTTING WITH DEEP NEURAL NETWORKS
    Prabhavalkar, Rohit
    Alvarez, Raziel
    Parada, Carolina
    Nakkiran, Preetum
    Sainath, Tara N.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4704 - 4708
  • [37] VIC-KD: VARIANCE-INVARIANCE-COVARIANCE KNOWLEDGE DISTILLATION TO MAKE KEYWORD SPOTTING MORE ROBUST AGAINST ADVERSARIAL ATTACKS
    Guimaraes, Heitor R.
    Pimentel, Arthur
    Avila, Anderson
    Falk, Tiago H.
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 12196 - 12200
  • [38] Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention
    Jung, Myunghun
    Jung, Youngmoon
    Goo, Jahyun
    Kim, Hoirin
    INTERSPEECH 2020, 2020, : 931 - 935
  • [39] THE PROJECT LABOUR MARKET IN TOUCH: NEW NON-ROUTINE SKILLS VIA MOBILE GAME-BASED LEARNING
    Putz, Thomas
    LEVERAGING TECHNOLOGY FOR LEARNING, VOL II, 2012, : 367 - 372