TOWARD THE ULTIMATE SYNTHESIS RECOGNITION SYSTEM

被引：4

作者：

FURUI, S

机构：

[1] Nippon Telegraph Tel. Hum. I., Musashino-shi, Tokyo 180

来源：

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA | 1995年 / 92卷 / 22期

关键词：

D O I：

10.1073/pnas.92.22.10040

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

This paper predicts speech synthesis, speech recognition, and speaker recognition technology for the year 2001, and it describes the most important research problems to be solved in order to arrive at these ultimate synthesis and recognition systems. The problems for speech synthesis include natural and intelligible voice production, prosody control based on meaning, capability of controlling synthesized voice quality and choosing individual speaking style, multilingual and multidialectal synthesis, choice of application-oriented speaking styles, capability of adding emotion, and synthesis from concepts, The problems for speech recognition include robust recognition against speech variations, adaptation/normalization to variations due to environmental conditions and speakers, automatic knowledge acquisition for acoustic and linguistic modeling, spontaneous speech recognition, naturalness and ease of human-machine interaction, and recognition of emotion, The problems for speaker recognition are similar to those for speech recognition, The research topics related to all these techniques include the use of articulatory and perceptual constraints and evaluation methods for measuring the quality of technology and systems.

引用

页码：10040 / 10045

页数：6

共 32 条

[1] SPEECH TECHNOLOGY IN 2001 - NEW RESEARCH DIRECTIONS [J].