On the Influence of Automatic Segmentation and Clustering in Automatic Speech Recognition

被引：0

作者：

Lopez-Otero, Paula ^{[1
]}

Docio-Fernandez, Laura ^{[1
]}

Garcia-Mateo, Carmen ^{[1
]}

Cardenal-Lopez, Antonio ^{[1
]}

机构：

[1] Univ Vigo, AtlantTIC Res Ctr, Multimedia Technol Grp GTM, EE Telecomun, Vigo 36310, Spain

来源：

ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES | 2012年 / 328卷

关键词：

automatic segmentation; automatic speech recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

An automatic speech recognition (ASR) system needs a previous segmentation stage that differentiates between speech and non-speech. Other information such as "who spoke when" can be proportioned to the ASR system, allowing it to perform speaker adaptation. This paper studies the influence of automatic speech segmentation and speaker clustering on ASR performance, in order to detect the weak points of the diarization system by analyzing what causes the different types of recognition errors: insertions, suppressions and substitutions. Experiments are run on the Galician broadcast news database Transcrigal, and results show that the speaker diarization system presented in this work is suitable as a previous step to ASR, as the performance is almost the same as the obtained when using manual segmentation and clustering.

引用

页码：49 / 58

页数：10

共 50 条

[1] Automatic Speech Segmentation Based on Acoustical Clustering
Gomez, Jon A.
Sanchis, Emilio
Castro-Bleda, Maria J.
STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2010, 6218 : 540 - 548
[2] Automatic phrase segmentation and clustering in spontaneous speech
Beke, Andras
Szaszak, Gyorgy
Varadi, Viola
2013 IEEE 4TH INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2013, : 459 - 462
[3] Automatic Speech Segmentation for Automatic Speech Translation
Klosowski, Piotr
Dustor, Adam
COMPUTER NETWORKS, CN 2013, 2013, 370 : 466 - 475
[4] Automatic speech segmentation in syllable centric speech recognition system
Panda S.P.
Nayak A.K.
International Journal of Speech Technology, 2016, 19 (1) : 9 - 18
[5] Research on automatic speaker recognition based on speech clustering
Xu, Limin
Qian, Bo
Cheng, Weiming
Tang, Zhenmin
ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 2, PROCEEDINGS, 2006, : 105 - +
[6] Game Theoretic Approach for Automatic Speech Segmentation and Recognition
Rekha, J. Ujwala
Chatrapati, K. Shahu
Babu, A. Vinaya
2014 IEEE 28TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL (IEEEI), 2014,
[7] Automatic Acoustic Segmentation for Speech Recognition on Broadcast Recordings
Peng, Gang
Hwang, Mei-Yuh
Ostendorf, Mari
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2580 - 2583
[8] Automatic speech segmentation for an open vocabulary recognition system
Ban, L
Tatai, P
SIGNAL ANALYSIS & PREDICTION I, 1997, : 303 - 306
[9] AUTOMATIC SEGMENTATION OF SPEECH
VANHEMERT, JP
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (04) : 1008 - 1012
[10] Automatic sentence segmentation of speech for automatic summarization
Mrozinski, Joanna
Whittaker, Edward W. D.
Chatain, Pierre
Furui, Sadaoki
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 981 - 984

← 1 2 3 4 5 →