Unconstrained keyword spotting using phone lattices with application to spoken document retrieval

被引:15
作者
Foote, JT [1 ]
Young, SJ [1 ]
Jones, GJF [1 ]
Sparck-Jones, K [1 ]
机构
[1] UNIV CAMBRIDGE, COMP LAB, CAMBRIDGE CB2 3QG, ENGLAND
关键词
D O I
10.1006/csla.1997.0027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional hidden Markov model (HMM) word spotting requires both explicit HMM models of each desired keyword and a computationally expensive decoding pass. For certain applications, such as audio indexing or information retrieval, conventional word spotting may be too constrained or impractically slow. This paper presents an alternative technique, where a phone lattice - representing multiple phone hypotheses - is pre-computed prior to need. Given a phone decomposition of any desired keyword, the lattice may be rapidly searched to find putative occurrences of the keyword. Though somewhat less accurate, this can be substantially faster (orders of magnitude) and more flexible (any keyword may be detected) than previous approaches. This paper presents algorithms for lattice generation and scanning, and experimental results, including comparison with conventional keyword-HMM approaches. Finally, word spotting based on phone lattice scanning is demonstrated to be effective for spoken document retrieval. (C) 1997 Academic Press Limited.
引用
收藏
页码:207 / 224
页数:18
相关论文
共 26 条
  • [1] [Anonymous], THESIS CAMBRIDGE U
  • [2] BROWN MG, 1996, P ACM MULT 96 BOST N
  • [3] Coker Cecil H., 1990, ESCA WORKSH SPEECH S, P83
  • [4] GLAVITSCH U, 1992, P 15 ANN INT ACM SIG, P168
  • [5] JAMES DA, 1995, THESIS CAMBRIDGE U
  • [6] JAMES DA, 1994, P IEEE INT C AC SPEE, P377
  • [7] Jones G. J. F., 1996, SIGIR Forum, P30
  • [8] JONES GJF, 1995, P ICASSP 95, V1, P309
  • [9] JONES GJF, 1996, P ICASSP ATL APR, V1, P311
  • [10] JONES GJF, 1994, 335 CAMBR U COMP LAB