On the Influence of Automatic Segmentation and Clustering in Automatic Speech Recognition

被引:0
|
作者
Lopez-Otero, Paula [1 ]
Docio-Fernandez, Laura [1 ]
Garcia-Mateo, Carmen [1 ]
Cardenal-Lopez, Antonio [1 ]
机构
[1] Univ Vigo, AtlantTIC Res Ctr, Multimedia Technol Grp GTM, EE Telecomun, Vigo 36310, Spain
来源
ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES | 2012年 / 328卷
关键词
automatic segmentation; automatic speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An automatic speech recognition (ASR) system needs a previous segmentation stage that differentiates between speech and non-speech. Other information such as "who spoke when" can be proportioned to the ASR system, allowing it to perform speaker adaptation. This paper studies the influence of automatic speech segmentation and speaker clustering on ASR performance, in order to detect the weak points of the diarization system by analyzing what causes the different types of recognition errors: insertions, suppressions and substitutions. Experiments are run on the Galician broadcast news database Transcrigal, and results show that the speaker diarization system presented in this work is suitable as a previous step to ASR, as the performance is almost the same as the obtained when using manual segmentation and clustering.
引用
收藏
页码:49 / 58
页数:10
相关论文
共 50 条
  • [1] Automatic Speech Segmentation Based on Acoustical Clustering
    Gomez, Jon A.
    Sanchis, Emilio
    Castro-Bleda, Maria J.
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2010, 6218 : 540 - 548
  • [2] Automatic phrase segmentation and clustering in spontaneous speech
    Beke, Andras
    Szaszak, Gyorgy
    Varadi, Viola
    2013 IEEE 4TH INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2013, : 459 - 462
  • [3] Automatic Speech Segmentation for Automatic Speech Translation
    Klosowski, Piotr
    Dustor, Adam
    COMPUTER NETWORKS, CN 2013, 2013, 370 : 466 - 475
  • [4] Automatic speech segmentation in syllable centric speech recognition system
    Panda S.P.
    Nayak A.K.
    International Journal of Speech Technology, 2016, 19 (1) : 9 - 18
  • [5] Research on automatic speaker recognition based on speech clustering
    Xu, Limin
    Qian, Bo
    Cheng, Weiming
    Tang, Zhenmin
    ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 2, PROCEEDINGS, 2006, : 105 - +
  • [6] Game Theoretic Approach for Automatic Speech Segmentation and Recognition
    Rekha, J. Ujwala
    Chatrapati, K. Shahu
    Babu, A. Vinaya
    2014 IEEE 28TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL (IEEEI), 2014,
  • [7] Automatic Acoustic Segmentation for Speech Recognition on Broadcast Recordings
    Peng, Gang
    Hwang, Mei-Yuh
    Ostendorf, Mari
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2580 - 2583
  • [8] Automatic speech segmentation for an open vocabulary recognition system
    Ban, L
    Tatai, P
    SIGNAL ANALYSIS & PREDICTION I, 1997, : 303 - 306
  • [9] AUTOMATIC SEGMENTATION OF SPEECH
    VANHEMERT, JP
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (04) : 1008 - 1012
  • [10] Automatic sentence segmentation of speech for automatic summarization
    Mrozinski, Joanna
    Whittaker, Edward W. D.
    Chatain, Pierre
    Furui, Sadaoki
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 981 - 984