Message-driven speech recognition and topic-word extraction

被引:1
作者
Ohtsuki, K [1 ]
Furui, S [1 ]
Iwasaki, A [1 ]
Sakurai, N [1 ]
机构
[1] Tokyo Inst Technol, Meguro Ku, Tokyo 1528552, Japan
来源
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI | 1999年
关键词
D O I
10.1109/ICASSP.1999.759744
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a new formulation for speech recognition/understanding systems, in which the a posteriori probability of a speaker's message that the speaker intend to address given an observed acoustic sequence is maximized. This is an extension of the current criterion that maximizes a probability of a word sequence. Among the various possible representations, we employ cooccurrence score of words measured by mutual information as the conditional probability of a word sequence occurring in a given message. The word sequence hypotheses obtained by bigram and trigram language models am rescored using the co-occurrence score. Experimental results show that the word accuracy is improved by this method. Topic-words, which represent the content of a speech signal are then extracted from speech recognition results based on the significance score of each word. When five topic-words are extracted for each broadcast-news article, 82.8%, of them are correct in average. This paper also proposes a verbalization-dependent language model, which is useful for Japanese dictation systems.
引用
收藏
页码:625 / 628
页数:4
相关论文
共 8 条
[1]  
Chen S., 1998, P ICASSP 98
[2]  
FURUI S, 1998, P BROADCAST NEWS TRA, P144
[3]   CO-OCCURRENCE AND TRANSFORMATION IN LINGUISTIC STRUCTURE [J].
HARRIS, ZS .
LANGUAGE, 1957, 33 (03) :283-340
[4]  
JUANG BH, 1996, IEEE WORKSH NEUR NET
[5]   A CACHE-BASED NATURAL-LANGUAGE MODEL FOR SPEECH RECOGNITION [J].
KUHN, R ;
DEMORI, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (06) :570-583
[6]  
MILLER S, 1994, P HUM LANG TECHN WOR, P278
[7]  
NOREAULT T, 1997, INFORMATION RETRIEVA, P57
[8]  
PIERACCINI R, 1992, P ICASSP 92