TOWARD THE ULTIMATE SYNTHESIS RECOGNITION SYSTEM

被引:4
作者
FURUI, S
机构
[1] Nippon Telegraph Tel. Hum. I., Musashino-shi, Tokyo 180
关键词
D O I
10.1073/pnas.92.22.10040
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper predicts speech synthesis, speech recognition, and speaker recognition technology for the year 2001, and it describes the most important research problems to be solved in order to arrive at these ultimate synthesis and recognition systems. The problems for speech synthesis include natural and intelligible voice production, prosody control based on meaning, capability of controlling synthesized voice quality and choosing individual speaking style, multilingual and multidialectal synthesis, choice of application-oriented speaking styles, capability of adding emotion, and synthesis from concepts, The problems for speech recognition include robust recognition against speech variations, adaptation/normalization to variations due to environmental conditions and speakers, automatic knowledge acquisition for acoustic and linguistic modeling, spontaneous speech recognition, naturalness and ease of human-machine interaction, and recognition of emotion, The problems for speaker recognition are similar to those for speech recognition, The research topics related to all these techniques include the use of articulatory and perceptual constraints and evaluation methods for measuring the quality of technology and systems.
引用
收藏
页码:10040 / 10045
页数:6
相关论文
共 32 条
[1]   SPEECH TECHNOLOGY IN 2001 - NEW RESEARCH DIRECTIONS [J].
ATAL, BS .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (22) :10046-10051
[2]  
ATAL BS, 1983, P IEEE INT C ACOUSTI
[3]  
BASSON S, 1992, P COST 232 WORKSHOP
[4]   MODELS OF NATURAL-LANGUAGE UNDERSTANDING [J].
BATES, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (22) :9977-9982
[5]  
CARLSON R, 1985, P NATL ACAD SCI USA, V92, P9932
[6]   THE ROLE OF VOICE INPUT FOR HUMAN-MACHINE COMMUNICATION [J].
COHEN, PR ;
OVIATT, SL .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (22) :9921-9927
[7]  
FLANAGANJL, 1991, P EUROSPEECH 91, P7
[8]   ON THE ROLE OF SPECTRAL TRANSITION FOR SPEECH-PERCEPTION [J].
FURUI, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1986, 80 (04) :1016-1025
[9]  
FURUI S, 1992, P ESCA WORKSH SPEECH, P31
[10]  
FURUI S, 1989, DIGITAL SPEECH PROCE