Robust Keyword Spotting with Rapidly Adapting Point Process Models

被引：0

作者：

Jansen, Aren ^{[1
]}

Niyogi, Partha ^{[1
]}

机构：

[1] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

keyword spotting; point process model; noise adaptation;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we investigate the noise robustness properties of frame-based and sparse point process-based models for spotting keywords in continuous speech. We introduce a new strategy to improve point process model (PPM) robustness by adapting low-level feature detector thresholds to preserve background firing rates in the presence of noise. We find that this unsupervised approach can significantly outperform fully supervised maximum likelihood linear regression (MLLR) adaptation of an equivalent keyword-filler HMM system in the presence of additive white and pink noise. Moreover, we find that the sparsity of PPMs introduces an inherent resilience to non-stationary babble noise not exhibited by the frame-based HMM system. Finally, we demonstrate that our approach requires less adaptation data than MLLR, permitting rapid online adaptation.

引用

页码：2727 / 2730

页数：4

共 50 条

[21] Robust Keyword Spotting for Noisy Environments by Leveraging Speech Enhancement and Speech Presence Probability
Yang, Chouchang
Saidutta, Yashas Malur
Srinivasa, Rakshith Sharma
Lee, Ching-Hua
Shen, Yilin
Jin, Hongxia
INTERSPEECH 2023, 2023, : 1638 - 1642
[22] Inverting the Point Process Model for Fast Phonetic Keyword Search
Kintzley, Keith
Jansen, Aren
Church, Kenneth
Hermansky, Hynek
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2437 - 2440
[23] Training Keyword Spotting Models on Non-IID Data with Federated Learning
Hard, Andrew
Partridge, Kurt
Nguyen, Cameron
Subrahmanya, Niranjan
Shah, Aishanee
Zhu, Pai
Moreno, Ignacio Lopez
Mathews, Rajiv
INTERSPEECH 2020, 2020, : 4343 - 4347
[24] Data-Adaptive Single-Pole Filtering of Magnitude Spectra for Robust Keyword Spotting
Jayant Kumar Rout
Gayadhar Pradhan
Circuits, Systems, and Signal Processing, 2022, 41 : 3023 - 3039
[25] Data-Adaptive Single-Pole Filtering of Magnitude Spectra for Robust Keyword Spotting
Rout, Jayant Kumar
Pradhan, Gayadhar
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (05) : 3023 - 3039
[26] Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets
Zeng, Lu
Parthasarathi, Sree Hari Krishnan
Liu, Yuzong
Escott, Alex
Cheekatmalla, Santosh
Strom, Nikko
Vitaladevuni, Shiv
TEXT, SPEECH, AND DIALOGUE (TSD 2022), 2022, 13502 : 364 - 376
[27] STREAMING SMALL-FOOTPRINT KEYWORD SPOTTING USING SEQUENCE-TO-SEQUENCE MODELS
He, Yanzhang
Prabhavalkar, Rohit
Rao, Kanishka
Li, Wei
Bakhtin, Anton
McGraw, Ian
2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 474 - 481
[28] CONTEXT-DEPENDENT POINT PROCESS MODELS FOR KEYWORD SEARCH AND DETECTION-BASED ASR
Liu, Chunxi
Jansen, Aren
Khudanpur, Sanjeev
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6025 - 6029
[29] Multi-stream LSTM-HMM decoding and histogram equalization for noise robust keyword spotting
Woellmer, Martin
Marchi, Erik
Squartini, Stefano
Schuller, Bjoern
COGNITIVE NEURODYNAMICS, 2011, 5 (03) : 253 - 264
[30] TE-KWS: Text-Informed Speech Enhancement for Noise-Robust Keyword Spotting
Liu, Dong
Mao, Qirong
Gao, Lijian
Ren, Qinghua
Chen, Zhenghan
Dong, Ming
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 601 - 610

← 1 2 3 4 5 →