ATR JAPANESE SPEECH DATABASE AS A TOOL OF SPEECH RECOGNITION AND SYNTHESIS

被引:206
作者
KUREMATSU, A
TAKEDA, K
SAGISAKA, Y
KATAGIRI, S
KUWABARA, H
SHIKANO, K
机构
[1] ATR Interpreting Telephony Research Laboratories, Souraku-gun, Kyoto, 619-02, Inuidani, Seika-cho
关键词
D O I
10.1016/0167-6393(90)90011-W
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A large-scale Japanese speech database has been described. The database basically consists of (1) a word speech database, (2) a continuous speech database, (3) a database for a large number of speakers, and (4) a database for speech synthesis. Multiple transcriptions have been made in five different layers from simple phonemic descriptions to fine acoustic-phonetic transcriptions. The database has been used to develop algorithms in speech recognition and synthesis studies and to find acoustic, phonetic and linguistic evidence that will serve as basic data for speech technologies. © 1990.
引用
收藏
页码:357 / 363
页数:7
相关论文
共 15 条
  • [1] ABE M, 1988, P ICASSP, P655
  • [2] CARRE R, 1984, P ICASSP84
  • [3] Hatazaki K., 1989, P INT C AC SPEECH SI, P393
  • [4] ISO K, 1988, MAR P ACC SOC JAP, P89
  • [5] Itahashi S., 1985, Journal of the Acoustical Society of Japan, V41, P723
  • [6] KAWABATA T, 1989, P ICASSP89, P461
  • [7] KUWABARA H, 1989, P ICASSP89, P560
  • [8] PALLETT DS, 1987, PUBLIC DOMAIN SPEECH
  • [9] PERENNOU G, 1986, P ICASSP86, P325
  • [10] SAGISAKA Y, 1988, P ICASSP, P679