Multimedia corpus of in-car speech communication

被引:9
|
作者
Kawaguchi, N [1 ]
Takeda, K [1 ]
Itakura, F [1 ]
机构
[1] Nagoya Univ, Ctr Integrated Acoust Informat Res, Chikusa Ku, Nagoya, Aichi 4648603, Japan
来源
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2004年 / 36卷 / 2-3期
关键词
Global Position System; Engine Speed; Automatic Speech Recognition; Speech Corpus; Differential Global Position System;
D O I
10.1023/B:VLSI.0000015094.60008.dc
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An ongoing project for constructing a multimedia corpus of dialogues under the driving condition is reported. More than 500 subjects have been enrolled in this corpus development and more than 2 gigabytes of signals have been collected during approximately 60 minutes of driving per subject. Twelve microphones and three video cameras are installed in a car to obtain audio and video data. In addition, five signals regarding car control and the location of the car provided by the Global Positioning System (GPS) are recorded. All signals are simultaneously recorded directly onto the hard disk of the PCs onboard the specially designed data collection vehicle (DCV). The in-car dialogues are initiated by a human operator, an automatic speech recognition (ASR) system and a wizard of OZ (WOZ) system so as to collect as many speech disfluencies as possible. In addition to the details of data collection, in this paper, preliminary results on intermedia signal conversion are described as an example of the corpus-based in-car speech signal processing research.
引用
收藏
页码:153 / 159
页数:7
相关论文
共 50 条
  • [1] Multimedia Corpus of In-Car Speech Communication
    Nobuo Kawaguchi
    Kazuya Takeda
    Fumitada Itakura
    Journal of VLSI signal processing systems for signal, image and video technology, 2004, 36 : 153 - 159
  • [2] THE AUSTRALIAN ENGLISH SPEECH CORPUS FOR IN-CAR SPEECH PROCESSING
    Kleinschmidt, Tristan
    Mason, Michael
    Wong, Eddie
    Sridharan, Sridha
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4177 - 4180
  • [3] Construction and evaluation of a large in-car speech corpus
    Takeda, K
    Fujimura, H
    Itou, K
    Kawaguchi, N
    Matsubara, S
    Itakura, F
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 553 - 561
  • [4] Hybrid in-car speech recognition for mobile multimedia applications
    Kuhn, T
    Jameel, A
    Stümpfle, M
    Haddadi, A
    1999 IEEE 49TH VEHICULAR TECHNOLOGY CONFERENCE, VOLS 1-3: MOVING INTO A NEW MILLENIUM, 1999, : 2009 - 2013
  • [5] Hybrid in-car speech recognition for mobile multimedia applications
    DaimlerChrysler Aerospace, Ulm, Germany
    IEEE Veh Technol Conf, (2009-2013):
  • [6] CIAIR in-car speech corpus - Influence of driving status
    Kawaguchi, N
    Matsubara, S
    Takeda, K
    Itakura, F
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 578 - 582
  • [7] Analysis of a large in-car speech corpus and its application to the multimodel ASR
    Fujimua, H
    Miyajima, C
    Itou, K
    Takeda, K
    Itakura, F
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 445 - 448
  • [8] Cultural Analyses of In-car Communication
    Carbaugh, Donal
    Winter, Ute
    van Over, Brion
    Molina-Markham, Elizabeth
    Lie, Sunny
    JOURNAL OF APPLIED COMMUNICATION RESEARCH, 2013, 41 (02) : 195 - 201
  • [9] Dual-Microphone Speech Reinforcement System With Howling-Control for In-Car Speech Communication
    Alkaher, Yehav
    Cohen, Israel
    FRONTIERS IN SIGNAL PROCESSING, 2022, 2
  • [10] Wireless in-car communication with Bluetooth
    Fügen, Thomas
    von Hagen, Jürgen
    Wiesbeck, Werner
    ATZ worldwide, 2002, 104 (7-8) : 25 - 28