Embodied conversational agents in Wizard-of-Oz and multimodal interaction applications

被引:0
作者
Rojc, Matej [1 ]
Rotovnik, Tomaz [1 ]
Brus, Miso [2 ]
Jan, Dusan [2 ]
Kacic, Zdravko [1 ]
机构
[1] Univ Maribor, Fac Elect Engn & Comp Sci, Maribor, Slovenia
[2] Agito doo, Ljubljana, Slovenia
来源
VERBAL AND NONVERBAL COMMUNICATION BEHAVIOURS | 2007年 / 4775卷
关键词
conversational agents; speech recognition; text-to-speech synthesis; speech-to-speech translation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Embodied conversational agents employed in multimodal interaction applications have the potential to achieve similar properties as humans in face-to-face conversation. They enable the inclusion of verbal and nonverbal communication. Thus, the degree of personalization of the user interface is much higher than in other human-computer inter-faces. This, of course, greatly contributes to the naturalness and user friendliness of the interface, opening-up a wide area of possible applications. Two implementations of embodied conversational agents in human-computer interaction are presented in this paper: the first one in a Wizard-of-Oz application and the second in a dialogue system. In the Wizard-of-Oz application, the embodied conversational agent is applied in a way that it conveys the spoken information of the operator to the user with whom the operator communicates. Depending on the scenario of the application, the user may or not be aware of the operator's involvement. The operator can communicate with the user based on audio/visual, or only audio, communication. This paper describes an application setup, which enables distant communication with the user, where the user is unaware of the operator's involvement. A real-time viseme recognizer is needed to ensure a proper response from the agent. In addition, implementation of the embodied conversational agent Lili hosting an entertainment show, which is broadcast by RTV Slovenia, will be described in more detail. Employment of the embodied conversational agent as a virtual major-domo named Maja, within an intelligent ambience, using speech recognition system and TTS system PLATTOS, will be also described.
引用
收藏
页码:294 / +
页数:3
相关论文
共 30 条
  • [21] QuickSet: Multimodal interaction for distributed applications
    Cohen, PR
    Johnston, M
    McGee, D
    Oviatt, S
    Pittman, J
    Smith, I
    Chen, L
    Clow, J
    ACM MULTIMEDIA 97, PROCEEDINGS, 1997, : 31 - 40
  • [22] Far-field multimodal speech processing and conversational interaction in smart spaces
    Potamianos, Gerasimos
    Huang, Jing
    Marcheret, Etienne
    Libal, Vit
    Balchandran, Rajesh
    Epstein, Mark
    Seredi, Ladislav
    Labsk, Martin
    Ures, Lubos
    Black, Matthew
    Lucey, Patrick
    2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 120 - +
  • [23] Exploring Smart Agents for the Interaction with Multimodal Mediated Environments
    Richer, Robert
    Zhao, Nan
    Eskofier, Bjoern M.
    Paradiso, Joseph A.
    MULTIMODAL TECHNOLOGIES AND INTERACTION, 2020, 4 (02) : 1 - 18
  • [24] A Survey of Conversational Agents and Their Applications for Self-Management of Chronic Conditions
    Park, Min Sook
    Upama, Paramita Basak
    Anik, Adib Ahmed
    Ahamed, Sheikh Iqbal
    Luo, Jake
    Tian, Shiyu
    Rabbani, Masud
    Oh, Hyungkyoung
    2023 IEEE 47TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC, 2023, : 1064 - 1075
  • [25] The EducAgent Platform: Intelligent Conversational Agents for E-Learning Applications
    Griol, David
    Garcia-Herrero, Jesus
    Molina, Jose M.
    AMBIENT INTELLIGENCE: SOFTWARE AND APPLICATIONS, 2011, 92 : 117 - 124
  • [26] Rethinking Interaction with Conversational Agents: How to Create a Positive User Experience Utilizing Dialog Patterns
    Heuer, Marvin
    Lewandowski, Tom
    Weglewski, Joffrey
    Mayer, Tom
    Kubicek, Max
    Lembke, Patrick
    Ortgiese, Simon
    Boehmann, Tilo
    DESIGN, USER EXPERIENCE, AND USABILITY, DUXU 2023, PT IV, 2023, 14033 : 283 - 301
  • [27] The Chatbot Usability Scale: the Design and Pilot of a Usability Scale for Interaction with AI-Based Conversational Agents
    Borsci S.
    Malizia A.
    Schmettow M.
    van der Velde F.
    Tariverdiyeva G.
    Balaji D.
    Chamberlain A.
    Personal and Ubiquitous Computing, 2022, 26 (1) : 95 - 119
  • [28] Patient Engagement with Conversational Agents in Health Applications 2016-2022: A Systematic Review and Meta-Analysis
    Cevasco, Kevin E.
    Brown, Rachel E. Morrison
    Woldeselassie, Rediet
    Kaplan, Seth
    JOURNAL OF MEDICAL SYSTEMS, 2024, 48 (01)
  • [29] Deliberative and Paternalistic Interaction Styles for Conversational Agents in Digital Health: Procedure and Validation Through a Web-Based Experiment
    Schachner, Theresa
    Gross, Christoph
    Hasl, Andrea
    Wangenheim, Florian, V
    Kowatsch, Tobias
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (01)
  • [30] A contemporary review on chatbots, AI-powered virtual conversational agents, ChatGPT: Applications, open challenges and future research directions
    Casheekar, Avyay
    Lahiri, Archit
    Rath, Kanishk
    Prabhakar, Kaushik Sanjay
    Srinivasan, Kathiravan
    COMPUTER SCIENCE REVIEW, 2024, 52