Prosodic focus control in reply speech generation for a spoken dialogue system of information retrieval

被引：2

作者：

Kiriyama, S ^{[1
]}

Hirose, K ^{[1
]}

Minematsu, N ^{[1
]}

机构：

[1] Shizuoka Univ, Fac Informat, Hamamatsu, Shizuoka 4328011, Japan

来源：

PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS | 2002年

关键词：

D O I：

10.1109/WSS.2002.1224393

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A spoken dialogue system of information retrieval on academic documents has been developed with a special attention to reply speech generation. In order to realize speech reply with its prosodic features properly controlled to express dialogue focuses, a scheme was developed to directly generating speech reply from reply content. When developing the system firstly, a priority was placed on the automatic processing, and prosodic focus was controlled by rather simple rules (original rules). Based on the listening test for the reply speech generated using original rules, new rules were then developed. Through the further listening test, the rules were revised and called the revised rules. The validity of the revised rules was verified through an evaluation experiment. It was also indicated that there existed users' preferences on the intonation of the reply speech.

引用

页码：139 / 142

页数：4

共 50 条

[41] An information retrieval system for telephone dialogue in load dispatch center
Segawa, Osamu
Takeda, Kazuya
ELECTRICAL ENGINEERING IN JAPAN, 2008, 162 (03) : 44 - 50
[42] An information retrieval system for telephone dialogue in load dispatch center
Segawa, Osamu
Takeda, Kazuya
Electrical Engineering in Japan (English translation of Denki Gakkai Ronbunshi), 2008, 162 (03): : 44 - 50
[43] Applying an information-seeking dialogue model in an interactive information retrieval system
Yuan, Xiaojun
Belkin, Nicholas J.
JOURNAL OF DOCUMENTATION, 2014, 70 (05) : 829 - 855
[44] A system for information retrieval from large records of Czech spoken data
Nouza, Jan
Zd'ansky, Jindrich
Cerva, Petr
Kolorenc, Jan
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 485 - 492
[45] Recognition of para-linguistic information and its application to spoken dialogue system
Fujie, S
Ejiri, Y
Matsusaka, Y
Kikuchi, H
Kobayashi, T
ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 231 - 236
[46] Direct speech-reply generation from text-dialogue context
Fujita, Kenichi
Ijima, Yusuke
Sugiyama, Hiroaki
PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1655 - 1660
[47] Control of Prosodic Focus in Corpus-based Generation of Fundamental Frequency based on the Generation Process Model
Ochi, Keiko
Hirose, Keikichi
Minematsu, Nobuaki
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1216 - 1216
[48] Web-enhanced Content Retrieval for Information Access Dialogue System
Lee, Donghyeon
Lee, Cheongjae
Jeong, Minwoo
Kim, Kyungduk
Kim, Seokhwan
Choi, Junhwi
Lee, Gary Geunbae
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1304 - +
[49] INCORPORATING SEMANTIC INFORMATION TO SELECTION OF WEB TEXTS FOR LANGUAGE MODEL OF SPOKEN DIALOGUE SYSTEM
Yoshino, Koichiro
Mori, Shinsuke
Kawahara, Tatsuya
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8252 - 8256
[50] Improved concept-to-speech generation in a dialogue system on road guidance
Yagi, Y
Takada, S
Hirose, K
Minematsu, N
2005 International Conference on Cyberworlds, Proceedings, 2005, : 429 - 436

← 1 2 3 4 5 →