ATR JAPANESE SPEECH DATABASE AS A TOOL OF SPEECH RECOGNITION AND SYNTHESIS

被引：216

作者：

KUREMATSU, A

TAKEDA, K

SAGISAKA, Y

KATAGIRI, S

KUWABARA, H

SHIKANO, K

机构：

[1] ATR Interpreting Telephony Research Laboratories, Souraku-gun, Kyoto, 619-02, Inuidani, Seika-cho

来源：

SPEECH COMMUNICATION | 1990年 / 9卷 / 04期

关键词：

D O I：

10.1016/0167-6393(90)90011-W

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

A large-scale Japanese speech database has been described. The database basically consists of (1) a word speech database, (2) a continuous speech database, (3) a database for a large number of speakers, and (4) a database for speech synthesis. Multiple transcriptions have been made in five different layers from simple phonemic descriptions to fine acoustic-phonetic transcriptions. The database has been used to develop algorithms in speech recognition and synthesis studies and to find acoustic, phonetic and linguistic evidence that will serve as basic data for speech technologies. © 1990.

引用

页码：357 / 363

页数：7