Robust Keyword Spotting with Rapidly Adapting Point Process Models

被引:0
|
作者
Jansen, Aren [1 ]
Niyogi, Partha [1 ]
机构
[1] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
来源
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年
关键词
keyword spotting; point process model; noise adaptation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate the noise robustness properties of frame-based and sparse point process-based models for spotting keywords in continuous speech. We introduce a new strategy to improve point process model (PPM) robustness by adapting low-level feature detector thresholds to preserve background firing rates in the presence of noise. We find that this unsupervised approach can significantly outperform fully supervised maximum likelihood linear regression (MLLR) adaptation of an equivalent keyword-filler HMM system in the presence of additive white and pink noise. Moreover, we find that the sparsity of PPMs introduces an inherent resilience to non-stationary babble noise not exhibited by the frame-based HMM system. Finally, we demonstrate that our approach requires less adaptation data than MLLR, permitting rapid online adaptation.
引用
收藏
页码:2727 / 2730
页数:4
相关论文
共 50 条
  • [21] Robust Keyword Spotting for Noisy Environments by Leveraging Speech Enhancement and Speech Presence Probability
    Yang, Chouchang
    Saidutta, Yashas Malur
    Srinivasa, Rakshith Sharma
    Lee, Ching-Hua
    Shen, Yilin
    Jin, Hongxia
    INTERSPEECH 2023, 2023, : 1638 - 1642
  • [22] Inverting the Point Process Model for Fast Phonetic Keyword Search
    Kintzley, Keith
    Jansen, Aren
    Church, Kenneth
    Hermansky, Hynek
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2437 - 2440
  • [23] Training Keyword Spotting Models on Non-IID Data with Federated Learning
    Hard, Andrew
    Partridge, Kurt
    Nguyen, Cameron
    Subrahmanya, Niranjan
    Shah, Aishanee
    Zhu, Pai
    Moreno, Ignacio Lopez
    Mathews, Rajiv
    INTERSPEECH 2020, 2020, : 4343 - 4347
  • [24] Data-Adaptive Single-Pole Filtering of Magnitude Spectra for Robust Keyword Spotting
    Jayant Kumar Rout
    Gayadhar Pradhan
    Circuits, Systems, and Signal Processing, 2022, 41 : 3023 - 3039
  • [25] Data-Adaptive Single-Pole Filtering of Magnitude Spectra for Robust Keyword Spotting
    Rout, Jayant Kumar
    Pradhan, Gayadhar
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (05) : 3023 - 3039
  • [26] Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets
    Zeng, Lu
    Parthasarathi, Sree Hari Krishnan
    Liu, Yuzong
    Escott, Alex
    Cheekatmalla, Santosh
    Strom, Nikko
    Vitaladevuni, Shiv
    TEXT, SPEECH, AND DIALOGUE (TSD 2022), 2022, 13502 : 364 - 376
  • [27] STREAMING SMALL-FOOTPRINT KEYWORD SPOTTING USING SEQUENCE-TO-SEQUENCE MODELS
    He, Yanzhang
    Prabhavalkar, Rohit
    Rao, Kanishka
    Li, Wei
    Bakhtin, Anton
    McGraw, Ian
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 474 - 481
  • [28] CONTEXT-DEPENDENT POINT PROCESS MODELS FOR KEYWORD SEARCH AND DETECTION-BASED ASR
    Liu, Chunxi
    Jansen, Aren
    Khudanpur, Sanjeev
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6025 - 6029
  • [29] Multi-stream LSTM-HMM decoding and histogram equalization for noise robust keyword spotting
    Woellmer, Martin
    Marchi, Erik
    Squartini, Stefano
    Schuller, Bjoern
    COGNITIVE NEURODYNAMICS, 2011, 5 (03) : 253 - 264
  • [30] TE-KWS: Text-Informed Speech Enhancement for Noise-Robust Keyword Spotting
    Liu, Dong
    Mao, Qirong
    Gao, Lijian
    Ren, Qinghua
    Chen, Zhenghan
    Dong, Ming
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 601 - 610