Recognition of para-linguistic information and its application to spoken dialogue system

被引:8
|
作者
Fujie, S [1 ]
Ejiri, Y [1 ]
Matsusaka, Y [1 ]
Kikuchi, H [1 ]
Kobayashi, T [1 ]
机构
[1] Waseda Univ, Sch Sci & Engn, Tokyo, Japan
关键词
D O I
10.1109/ASRU.2003.1318446
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The human-human interactions in a spoken dialogue seem to use not only linguistic information in the utterances but also some sorts of additional information supporting linguistic information. We call these sorts of additional information "para-linguistic information". In this paper, we present a recognition method of attitudes by prosodic information, and a recognition method of head gestures. In the former method, in order to recognize two attitudes, such as "positive" and "negative", F0 pattern and phoneme alignment are introduced as features. In the latter method, in order to recognize three gestures, such as "nod", "tilt" and "shake", left-to-right HMM is introduced as the probabilistic model as well as optical flow is introduced as features. Experiment results show that these methods are sufficient to recognize user's attitude as para-linguistic information. Finally, we show a proto-type spoken dialogue system using para-linguistic information and how these sorts of information contribute the efficient conversation.
引用
收藏
页码:231 / 236
页数:6
相关论文
共 50 条
  • [1] Anger Recognition in Spoken Dialog Using Linguistic and Para-Linguistic Information
    Nomoto, Narichika
    Tamoto, Masafumi
    Masataki, Hirokazu
    Yoshioka, Osamu
    Takahashi, Satoshi
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1556 - 1559
  • [2] A conversation robot using head gesture recognition as para-linguistic information
    Fujie, S
    Ejiri, Y
    Nakajima, K
    Matsusaka, Y
    Kobayashi, T
    RO-MAN 2004: 13TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, PROCEEDINGS, 2004, : 159 - 164
  • [3] Recognition of positive/negative attitude and its application to a spoken dialogue system
    Fujie, Shinya
    Ejiri, Yasushi
    Kikuchi, Hideaki
    Kobayashi, Tetsunori
    Systems and Computers in Japan, 2006, 37 (12): : 45 - 55
  • [4] Para-linguistic information represented as distortion of the acoustic universal structure in speech
    Minematsu, Nobuaki
    Asakawa, Satoshi
    Hirose, Keikichi
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 261 - 264
  • [5] Straight-tempo: A universal tool to manipulate linguistic and para-linguistic specific information
    Kawahara, H
    SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 1620 - 1625
  • [6] Subject-independent Pain Recognition using Physiological Signals and Para-linguistic Vocalizations
    Shoukry, Nadeen
    Elkilany, Omar
    Thiam, Patrick
    Kessler, Viktor
    Schwenker, Friedhelm
    ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 142 - 150
  • [7] A Chinese spoken dialogue system for train information
    Chen, JY
    Wu, J
    Wang, ZY
    2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 1097 - 1103
  • [8] Recognition of Paralinguistic Information in Spoken Dialogue Systems for Elderly People
    Perez-Espinosa, Humberto
    Martinez-Miranda, Juan
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, MICAI 2015, PT I, 2015, 9413 : 107 - 117
  • [9] Improvement of the recognition rate of spoken queries to the dialogue system
    Matousek, V
    Ocelíková, J
    TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 308 - 314
  • [10] Using Information State to Improve Dialogue Move Identification in a Spoken Dialogue System
    Ai, Hua
    Roque, Antonio
    Leuski, Anton
    Traum, David
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2596 - +