A real-time Speech-interfaced System for Group Conversation Modeling

被引:1
|
作者
Rocchi, Cesare [1 ]
Principi, Emanuele [1 ]
Cifani, Simone [1 ]
Rotili, Rudy [1 ]
Squartini, Stefano [1 ]
Piazza, Francesco [1 ]
机构
[1] Univ Politecn Marche, DIBET, 3MediaLabs, I-60131 Ancona, Italy
来源
NEURAL NETS WIRN09 | 2009年 / 204卷
关键词
Conversation modeling; Tabletop; keyword spotting;
D O I
10.3233/978-1-60750-072-8-70
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a speech-interfaced system for fostering group conversations is presented. The system captures conversation keywords and shows visual stimuli on a tabletop display. A stimulus can be a feedback to the current conversation or a cue to discuss new topics. This work describes the overall system architecture and highlights details about the design choices of the overall system, with a particular focus on the real-time implementation issues. A suitable speech enhancement front-end and a keyword spotter have been integrated on a common software platform for real-time audio processing, namely Nu-Tech, resulting in a helpful and flexible architecture for real-world applications in group conversation modeling scenarios. Such system characteristics, jointly with some experimental results obtained from simulations on recorded speech data, seem to confirm the efficacy of the approach motivating the development of further features and the experimentation in new scenarios.
引用
收藏
页码:70 / 80
页数:11
相关论文
共 50 条
  • [41] Real-time interfaces for speech and singing
    Hunt, A
    Howard, D
    Worsdall, J
    PROCEEDINGS OF THE 26TH EUROMICRO CONFERENCE, VOLS I AND II, 2000, : A356 - A361
  • [42] A Hybrid Temporal Modeling Phoneme Recognized Network for Real-time Speech Animation
    Yu, Zixiao
    Wang, Haohong
    Ren, Jian
    THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020), 2020, : 55 - 60
  • [43] A Real-Time Text to Audio-Visual Speech Synthesis System
    Wang, Lijuan
    Qian, Xiaojun
    Ma, Lei
    Qian, Yao
    Chen, Yining
    Soong, Frank
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2338 - +
  • [44] Integration of Speech and Text Processing Modules into a Real-Time Dialogue System
    Ptacek, Jan
    Ircing, Pavel
    Spousta, Miroslav
    Romportl, Jan
    Loose, Zdenek
    Cinkova, Silvie
    Relano Gil, Jose
    Santos, Raul
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 552 - +
  • [45] LOW LATENCY PARAMETER GENERATION FOR REAL-TIME SPEECH SYNTHESIS SYSTEM
    Na, Xingyu
    Xie, Xiang
    Kuang, Jingming
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
  • [47] REAL-TIME SPEECH ENHANCEMENT SYSTEM USING ENVELOPE EXPANSION TECHNIQUE
    CLARKSON, PM
    BAHGAT, S
    ELECTRONICS LETTERS, 1989, 25 (17) : 1186 - 1188
  • [48] Real-Time Observation of Single Atoms Trapped and Interfaced to a Nanofiber Cavity
    Nayak, Kali P.
    Wang, Jie
    Keloth, Jameesh
    PHYSICAL REVIEW LETTERS, 2019, 123 (21)
  • [49] Real-time PCR Machine System Modeling and a Systematic Approach for the Robust Design of a Real-time PCR-on-a-Chip System
    Lee, Da-Sheng
    SENSORS, 2010, 10 (01) : 697 - 718
  • [50] MODELING IN REAL-TIME SYSTEMS
    BOASSON, M
    COMPUTER STANDARDS & INTERFACES, 1987, 6 (01) : 107 - 114