Progress in transcription of broadcast news using Byblos

被引:11
作者
Nguyen, L [1 ]
Matsoukas, S [1 ]
Davenport, J [1 ]
Kubala, F [1 ]
Schwartz, R [1 ]
Makhoul, J [1 ]
机构
[1] Verizon Commun, BBN Technol, Cambridge, MA 02138 USA
关键词
speech recognition; broadcast news transcription; hidden Markov models; acoustic modeling; adaptation; search algorithms; single-tree fast-match; fast Gaussian computation; grammar spreading;
D O I
10.1016/S0167-6393(02)00050-X
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we describe our progress during the last four years (1995-1999) in automatic transcription of broadcast news from radio and television using the BBN Byblos speech recognition system. Overall, we achieved steady progress as reflected through the results of the last four DARPA Hub-4 evaluations, with word error rates of 42.7%, 31.8%, 20.4% and 14.7% in 1995, 1996, 1997 and 1998, respectively. This progress can be attributed to improvements in acoustic modeling, channel and speaker adaptation, and search algorithms, as well as dealing with specific characteristics of the real-life variable speech found in broadcast news. Besides improving recognition accuracy, we also succeeded in developing several algorithms to achieve close-to-real-time recognition speed without a significant sacrifice in recognition accuracy. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:213 / 230
页数:18
相关论文
共 32 条
  • [1] ACERO A, 1995, P IEEE AUT SPEECH RE, P147
  • [2] ANASTASAKOS T, 1996, P ICSLP 96 PHIL PA O
  • [3] [Anonymous], P 1997 DARPA SPEECH
  • [4] AUSTIN S, 1991, INT CONF ACOUST SPEE, P697, DOI 10.1109/ICASSP.1991.150435
  • [5] A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS
    BAUM, LE
    PETRIE, T
    SOULES, G
    WEISS, N
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01): : 164 - &
  • [6] Recent experiments in Large Vocabulary Conversational Speech Recognition
    Billa, J
    Colhurst, T
    El-Jaroudi, A
    Iyer, R
    Ma, K
    Matsoukas, S
    Quillen, C
    Richardson, F
    Siu, M
    Zavaliagkos, G
    Gish, H
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 41 - 44
  • [7] DAVENPORT J, 1999, P IEEE ICASSP 99 PHO, P613
  • [8] DAVENPORT J, 1999, P EUR 99 BUD HUNG SE, P651
  • [9] DAVENPORT J, 1999, P DARPA BROADC NEWS, P261
  • [10] FISCUS JG, 1997, WORKSH LINTH MAR MAY