Robust Keyword Spotting with Rapidly Adapting Point Process Models

被引:0
|
作者
Jansen, Aren [1 ]
Niyogi, Partha [1 ]
机构
[1] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
来源
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年
关键词
keyword spotting; point process model; noise adaptation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate the noise robustness properties of frame-based and sparse point process-based models for spotting keywords in continuous speech. We introduce a new strategy to improve point process model (PPM) robustness by adapting low-level feature detector thresholds to preserve background firing rates in the presence of noise. We find that this unsupervised approach can significantly outperform fully supervised maximum likelihood linear regression (MLLR) adaptation of an equivalent keyword-filler HMM system in the presence of additive white and pink noise. Moreover, we find that the sparsity of PPMs introduces an inherent resilience to non-stationary babble noise not exhibited by the frame-based HMM system. Finally, we demonstrate that our approach requires less adaptation data than MLLR, permitting rapid online adaptation.
引用
收藏
页码:2727 / 2730
页数:4
相关论文
共 50 条
  • [1] ROBUST REPRESENTATIONS FOR KEYWORD SPOTTING SYSTEMS
    Smyth, Aidan
    Lyons, Niall
    Wada, Ted
    Zopf, Robert
    Pandey, Ashutosh
    Santra, Avik
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3210 - 3215
  • [2] Improved Keyword Spotting based on Keyword/Garbage Models
    Chen, Qiyu
    Zhang, Weibin
    Xu, Xiangmin
    Xing, Xiaofen
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [3] Point Process Models for Spotting Keywords in Continuous Speech
    Jansen, Aren
    Niyogi, Partha
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (08): : 1457 - 1470
  • [4] Prototypical Knowledge Distillation for Noise Robust Keyword Spotting
    Kim, Donghyeon
    Kim, Gwantae
    Lee, Bokyeung
    Ko, Hanseok
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2298 - 2302
  • [5] Multi-scale Convolution for Robust Keyword Spotting
    Yang, Chen
    Wen, Xue
    Song, Liming
    INTERSPEECH 2020, 2020, : 2577 - 2581
  • [6] Robust and efficient keyword spotting using a bidirectional attention LSTM
    Swain O.P.
    Hemanth H.
    Saran P.
    Kothandaraman M.
    Ravi L.
    Sailor H.
    Rajesh K.S.
    International Journal of Speech Technology, 2023, 26 (04) : 919 - 931
  • [7] Noisy student-teacher training for robust keyword spotting
    Park, Hyun-Jin
    Zhu, Pai
    Moreno, Ignacio Lopez
    Subrahmanya, Niranjan
    INTERSPEECH 2021, 2021, : 331 - 335
  • [8] Depthwise-Separable Residual Capsule for Robust Keyword Spotting
    Huang, Xianghong
    Yang, Qun
    Liu, Shaohan
    MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 194 - 204
  • [9] TRAINABLE FRONTEND FOR ROBUST AND FAR-FIELD KEYWORD SPOTTING
    Wang, Yuxuan
    Getreuer, Pascal
    Hughes, Thad
    Lyon, Richard F.
    Saurous, Rif A.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5670 - 5674
  • [10] Discriminatory and Orthogonal Feature Learning for Noise Robust Keyword Spotting
    Kim, Donghyeon
    Ko, Kyungdeuk
    Han, David K.
    Ko, Hanseok
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1913 - 1917