A real-time Speech-interfaced System for Group Conversation Modeling

被引:1
|
作者
Rocchi, Cesare [1 ]
Principi, Emanuele [1 ]
Cifani, Simone [1 ]
Rotili, Rudy [1 ]
Squartini, Stefano [1 ]
Piazza, Francesco [1 ]
机构
[1] Univ Politecn Marche, DIBET, 3MediaLabs, I-60131 Ancona, Italy
来源
NEURAL NETS WIRN09 | 2009年 / 204卷
关键词
Conversation modeling; Tabletop; keyword spotting;
D O I
10.3233/978-1-60750-072-8-70
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a speech-interfaced system for fostering group conversations is presented. The system captures conversation keywords and shows visual stimuli on a tabletop display. A stimulus can be a feedback to the current conversation or a cue to discuss new topics. This work describes the overall system architecture and highlights details about the design choices of the overall system, with a particular focus on the real-time implementation issues. A suitable speech enhancement front-end and a keyword spotter have been integrated on a common software platform for real-time audio processing, namely Nu-Tech, resulting in a helpful and flexible architecture for real-world applications in group conversation modeling scenarios. Such system characteristics, jointly with some experimental results obtained from simulations on recorded speech data, seem to confirm the efficacy of the approach motivating the development of further features and the experimentation in new scenarios.
引用
收藏
页码:70 / 80
页数:11
相关论文
共 50 条
  • [1] REAL-TIME SPEECH SYNTHESIS SYSTEM
    AINSWORTH, WA
    IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1972, AU20 (05): : 397 - +
  • [2] A real-time speech quality improvement system
    Zhao, HA
    ETFA 2003: IEEE CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION, VOL 1, PROCEEDINGS, 2003, : 491 - 495
  • [3] A Real-Time Scene Text to Speech System
    Neumann, Lukas
    Matas, Jiri
    COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 619 - 622
  • [4] Real-time speech synthesis system driven by visual speech
    Li, G
    Xie, GM
    Lin, L
    PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION SCIENCE AND TECHNOLOGY, VOL 2, 2004, : 397 - 402
  • [5] Modeling a complex real-time system
    Happonen, Ari
    Porras, Jari
    4TH INTERNATIONAL INDUSTRIAL SIMULATION CONFERENCE 2006, 2006, : 175 - +
  • [6] A REAL-TIME SPEECH DIALOG SYSTEM USING SPONTANEOUS SPEECH UNDERSTANDING
    TAKEBAYASHI, Y
    TSUBOI, H
    KANAZAWA, H
    SADAMOTO, Y
    HASHIMOTO, H
    SHINCHI, H
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1993, E76D (01) : 112 - 120
  • [7] Designing and Implementing a Real-Time Speech Summarizer System
    Cheng, Ding-Yuan
    Chen, Chi-Hua
    Wu, Yu-Rou
    Lo, Chi-Chun
    Lin, Hui-Fei
    2014 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2014), 2014, : 725 - 728
  • [8] Design and Evaluation of a Real-Time Speech Recognition System
    Shruthi, S.
    Yashaswi, G.
    Shruti, V
    Manikandan, J.
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 425 - 430
  • [9] VOXCOM - A SYSTEM FOR ANALYZING NATURAL SPEECH IN REAL-TIME
    ALPERT, M
    MEREWETHER, F
    HOMEL, P
    MARTZ, J
    LOMASK, M
    BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1986, 18 (02): : 267 - 272
  • [10] Speech Recognition System for Embedded Real-time Applications
    Cheng, Octavian
    Abdulla, Waleed
    Salcic, Zoran
    2009 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2009), 2009, : 118 - 122