Robust Keyword Spotting with Rapidly Adapting Point Process Models

被引:0
|
作者
Jansen, Aren [1 ]
Niyogi, Partha [1 ]
机构
[1] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
来源
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年
关键词
keyword spotting; point process model; noise adaptation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate the noise robustness properties of frame-based and sparse point process-based models for spotting keywords in continuous speech. We introduce a new strategy to improve point process model (PPM) robustness by adapting low-level feature detector thresholds to preserve background firing rates in the presence of noise. We find that this unsupervised approach can significantly outperform fully supervised maximum likelihood linear regression (MLLR) adaptation of an equivalent keyword-filler HMM system in the presence of additive white and pink noise. Moreover, we find that the sparsity of PPMs introduces an inherent resilience to non-stationary babble noise not exhibited by the frame-based HMM system. Finally, we demonstrate that our approach requires less adaptation data than MLLR, permitting rapid online adaptation.
引用
收藏
页码:2727 / 2730
页数:4
相关论文
共 50 条
  • [41] VIC-KD: VARIANCE-INVARIANCE-COVARIANCE KNOWLEDGE DISTILLATION TO MAKE KEYWORD SPOTTING MORE ROBUST AGAINST ADVERSARIAL ATTACKS
    Guimaraes, Heitor R.
    Pimentel, Arthur
    Avila, Anderson
    Falk, Tiago H.
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 12196 - 12200
  • [42] WHOLE WORD DISCRIMINATIVE POINT PROCESS MODELS
    Jansen, Aren
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5180 - 5183
  • [43] Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention
    Jung, Myunghun
    Jung, Youngmoon
    Goo, Jahyun
    Kim, Hoirin
    INTERSPEECH 2020, 2020, : 931 - 935
  • [44] Using network models to approximate spatial point-process models
    Bauch, CT
    Galvani, AP
    MATHEMATICAL BIOSCIENCES, 2003, 184 (01) : 101 - 114
  • [45] Point process models for mixed sessile forest stands
    Marie Ange Ngo Bieng
    Christian Ginisty
    François Goreaud
    Annals of Forest Science, 2011, 68 : 267 - 274
  • [46] Point process models for mixed sessile forest stands
    Ngo Bieng, Marie Ange
    Ginisty, Christian
    Goreaud, Francoise
    ANNALS OF FOREST SCIENCE, 2011, 68 (02) : 267 - 274
  • [47] Exploiting Discriminative Point Process Models for Spoken Term Detection
    Norouzian, Atta
    Jansen, Aren
    Rose, Richard
    Thomas, Samuel
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2441 - 2444
  • [48] Dependent multivariate diffusion models and related point process models of ensemble spiking neurons
    Rick L Jenison
    BMC Neuroscience, 8 (Suppl 2)
  • [49] Point process models, the dimensions of biodiversity and the importance of small-scale biotic interactions
    Mi, Xiangcheng
    Bao, Lei
    Chen, Jianhua
    Ma, Keping
    JOURNAL OF PLANT ECOLOGY, 2014, 7 (02) : 126 - 133
  • [50] Constraining spatial variability of methane ebullition seeps in thermokarst lakes using point process models
    Anthony, Katey M. Walter
    Anthony, Peter
    JOURNAL OF GEOPHYSICAL RESEARCH-BIOGEOSCIENCES, 2013, 118 (03) : 1015 - 1034