FAST LATTICE-FREE KEYWORD FILTERING FOR ACCELERATED SPOKEN TERM DETECTION

被引:0
|
作者
Wintrode, Jonathan [1 ]
Wilkes, Jenny [1 ]
机构
[1] Raytheon Appl Signal Technol, Annapolis Jct, MD 20701 USA
来源
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年
关键词
speech recognition; keyword spotting; term detection;
D O I
10.1109/icassp40776.2020.9054221
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a novel set of keyword detection techniques to accelerate spoken term detection for known queries with minimal loss in accuracy. Using only ASR frame-level acoustic posteriors we can train multiple models to effectively detect non-target segments for which we need not perform full lattice decoding. We estimate phone n-gram soft counts for each segment in a single pass over the frame-level output. From this we can efficiently detect a fixed set of keywords with both linear and DNN-based classifiers. Furthermore we can train the linear classifiers on a small number of labeled examples. Experiments on the PSC and VAST English subset of NIST's 2019 OpenSAT evaluation demonstrate we can filter out half of the test audio segments while only increasing the keyword miss rate by under 3%.
引用
收藏
页码:7469 / 7473
页数:5
相关论文
共 31 条
  • [31] Robust/Fast Out-of-Vocabulary Spoken Term Detection By N-gram Index with Exact Distance Through Text/Speech Input
    Sakamoto, Nagisa
    Nakagawa, Seiichi
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,