Prosodic focus control in reply speech generation for a spoken dialogue system of information retrieval

被引:2
|
作者
Kiriyama, S [1 ]
Hirose, K [1 ]
Minematsu, N [1 ]
机构
[1] Shizuoka Univ, Fac Informat, Hamamatsu, Shizuoka 4328011, Japan
关键词
D O I
10.1109/WSS.2002.1224393
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A spoken dialogue system of information retrieval on academic documents has been developed with a special attention to reply speech generation. In order to realize speech reply with its prosodic features properly controlled to express dialogue focuses, a scheme was developed to directly generating speech reply from reply content. When developing the system firstly, a priority was placed on the automatic processing, and prosodic focus was controlled by rather simple rules (original rules). Based on the listening test for the reply speech generated using original rules, new rules were then developed. Through the further listening test, the rules were revised and called the revised rules. The validity of the revised rules was verified through an evaluation experiment. It was also indicated that there existed users' preferences on the intonation of the reply speech.
引用
收藏
页码:139 / 142
页数:4
相关论文
共 50 条
  • [41] An information retrieval system for telephone dialogue in load dispatch center
    Segawa, Osamu
    Takeda, Kazuya
    ELECTRICAL ENGINEERING IN JAPAN, 2008, 162 (03) : 44 - 50
  • [42] An information retrieval system for telephone dialogue in load dispatch center
    Segawa, Osamu
    Takeda, Kazuya
    Electrical Engineering in Japan (English translation of Denki Gakkai Ronbunshi), 2008, 162 (03): : 44 - 50
  • [43] Applying an information-seeking dialogue model in an interactive information retrieval system
    Yuan, Xiaojun
    Belkin, Nicholas J.
    JOURNAL OF DOCUMENTATION, 2014, 70 (05) : 829 - 855
  • [44] A system for information retrieval from large records of Czech spoken data
    Nouza, Jan
    Zd'ansky, Jindrich
    Cerva, Petr
    Kolorenc, Jan
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 485 - 492
  • [45] Recognition of para-linguistic information and its application to spoken dialogue system
    Fujie, S
    Ejiri, Y
    Matsusaka, Y
    Kikuchi, H
    Kobayashi, T
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 231 - 236
  • [46] Direct speech-reply generation from text-dialogue context
    Fujita, Kenichi
    Ijima, Yusuke
    Sugiyama, Hiroaki
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1655 - 1660
  • [47] Control of Prosodic Focus in Corpus-based Generation of Fundamental Frequency based on the Generation Process Model
    Ochi, Keiko
    Hirose, Keikichi
    Minematsu, Nobuaki
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1216 - 1216
  • [48] Web-enhanced Content Retrieval for Information Access Dialogue System
    Lee, Donghyeon
    Lee, Cheongjae
    Jeong, Minwoo
    Kim, Kyungduk
    Kim, Seokhwan
    Choi, Junhwi
    Lee, Gary Geunbae
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1304 - +
  • [49] INCORPORATING SEMANTIC INFORMATION TO SELECTION OF WEB TEXTS FOR LANGUAGE MODEL OF SPOKEN DIALOGUE SYSTEM
    Yoshino, Koichiro
    Mori, Shinsuke
    Kawahara, Tatsuya
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8252 - 8256
  • [50] Improved concept-to-speech generation in a dialogue system on road guidance
    Yagi, Y
    Takada, S
    Hirose, K
    Minematsu, N
    2005 International Conference on Cyberworlds, Proceedings, 2005, : 429 - 436