A real-time Speech-interfaced System for Group Conversation Modeling

被引：1

作者：

Rocchi, Cesare ^{[1
]}

Principi, Emanuele ^{[1
]}

Cifani, Simone ^{[1
]}

Rotili, Rudy ^{[1
]}

Squartini, Stefano ^{[1
]}

Piazza, Francesco ^{[1
]}

机构：

[1] Univ Politecn Marche, DIBET, 3MediaLabs, I-60131 Ancona, Italy

来源：

NEURAL NETS WIRN09 | 2009年 / 204卷

关键词：

Conversation modeling; Tabletop; keyword spotting;

D O I：

10.3233/978-1-60750-072-8-70

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a speech-interfaced system for fostering group conversations is presented. The system captures conversation keywords and shows visual stimuli on a tabletop display. A stimulus can be a feedback to the current conversation or a cue to discuss new topics. This work describes the overall system architecture and highlights details about the design choices of the overall system, with a particular focus on the real-time implementation issues. A suitable speech enhancement front-end and a keyword spotter have been integrated on a common software platform for real-time audio processing, namely Nu-Tech, resulting in a helpful and flexible architecture for real-world applications in group conversation modeling scenarios. Such system characteristics, jointly with some experimental results obtained from simulations on recorded speech data, seem to confirm the efficacy of the approach motivating the development of further features and the experimentation in new scenarios.

引用

页码：70 / 80

页数：11

共 50 条

[41] Real-time interfaces for speech and singing
Hunt, A
Howard, D
Worsdall, J
PROCEEDINGS OF THE 26TH EUROMICRO CONFERENCE, VOLS I AND II, 2000, : A356 - A361
[42] A Hybrid Temporal Modeling Phoneme Recognized Network for Real-time Speech Animation
Yu, Zixiao
Wang, Haohong
Ren, Jian
THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020), 2020, : 55 - 60
[43] A Real-Time Text to Audio-Visual Speech Synthesis System
Wang, Lijuan
Qian, Xiaojun
Ma, Lei
Qian, Yao
Chen, Yining
Soong, Frank
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2338 - +
[44] Integration of Speech and Text Processing Modules into a Real-Time Dialogue System
Ptacek, Jan
Ircing, Pavel
Spousta, Miroslav
Romportl, Jan
Loose, Zdenek
Cinkova, Silvie
Relano Gil, Jose
Santos, Raul
TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 552 - +
[45] LOW LATENCY PARAMETER GENERATION FOR REAL-TIME SPEECH SYNTHESIS SYSTEM
Na, Xingyu
Xie, Xiang
Kuang, Jingming
2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
[46] Real-time air flow measurement system for use by speech therapists
McCurrach, G., 1600, (29):
[47] REAL-TIME SPEECH ENHANCEMENT SYSTEM USING ENVELOPE EXPANSION TECHNIQUE
CLARKSON, PM
BAHGAT, S
ELECTRONICS LETTERS, 1989, 25 (17) : 1186 - 1188
[48] Real-Time Observation of Single Atoms Trapped and Interfaced to a Nanofiber Cavity
Nayak, Kali P.
Wang, Jie
Keloth, Jameesh
PHYSICAL REVIEW LETTERS, 2019, 123 (21)
[49] Real-time PCR Machine System Modeling and a Systematic Approach for the Robust Design of a Real-time PCR-on-a-Chip System
Lee, Da-Sheng
SENSORS, 2010, 10 (01) : 697 - 718
[50] MODELING IN REAL-TIME SYSTEMS
BOASSON, M
COMPUTER STANDARDS & INTERFACES, 1987, 6 (01) : 107 - 114

← 1 2 3 4 5 →