Hard C-means clustering for voice activity detection

被引:34
作者
Gorriz, J. M. [1 ]
Ramirez, J.
Lang, E. W.
Puntonet, C. G.
机构
[1] Univ Granada, Dept Signal Theory Networking & Commun, E-18071 Granada, Spain
[2] Univ Regensburg, Inst Biophys, D-93040 Regensburg, Germany
[3] Univ Granada, Dept Comp Architecture & Technol, E-18071 Granada, Spain
关键词
voice activity detection; speech recognition; clustering; C-means; prototypes; subband energy;
D O I
10.1016/j.specom.2006.07.006
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An effective voice activity detection (VAD) algorithm is proposed for improving speech recognition performance in noisy environments. The proposed speech/pause discrimination method is based on a hard-decision clustering approach built on a set of subband log-energies and noise prototypes that define a cluster. Detecting the presence of speech (a new cluster) is achieved using a basic sequential algorithm scheme (BSAS) according to a given "distance" (in this case, geometrical distance) and a suitable threshold. The accuracy of the Cluster VAD (CIVAD) algorithm lies in the use of a decision function defined over a multiple-observation (MO) window of averaged subband log-energies and a suitable noise subspace model defined in terms of prototypes. In addition, the reduced computational cost of the clustering approach makes it adequate for real-time applications, i.e. speech recognition. An exhaustive analysis is conducted on the Spanish SpeechDat-Car databases in order to assess the performance of the proposed method and to compare it to existing standard VAD methods. The results show improvements in detection accuracy over standard VADs such as ITU-T G.729, ETSI GSM AMR and ETSI AFE and a representative set of recently reported VAD algorithms for noise robust speech processing. (C) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:1638 / 1649
页数:12
相关论文
共 50 条
[41]   On Objective-Based Rough c-Means Clustering [J].
Endo, Yasunori ;
Kinoshita, Naohiko .
2012 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC 2012), 2012,
[42]   Generalized Fuzzy C-Means with Spatial Information for Clustering of Remote Sensing Images [J].
Aydav, Prem Shankar Singh ;
Minz, Sonajharia .
2014 INTERNATIONAL CONFERENCE ON DATA MINING AND INTELLIGENT COMPUTING (ICDMIC), 2014,
[43]   Research and improvement of C-means clustering algorithm based on Image segmentation application [J].
Wang, Chunying ;
Zhang, Jiahui ;
Yang, Qi .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (06) :10325-10335
[44]   Application of Fuzzy and Possibilistic c-Means Clustering Models in Blind Speaker Clustering [J].
Gosztolya, Gabor ;
Szilagyi, Laszlo .
ACTA POLYTECHNICA HUNGARICA, 2015, 12 (07) :41-56
[45]   Genetically derived Fuzzy c-means clustering algorithm for segmentation [J].
Kachouie, NN ;
Alirezaie, J ;
Raahemifar, K .
CCECE 2003: CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, PROCEEDINGS: TOWARD A CARING AND HUMANE TECHNOLOGY, 2003, :1119-1122
[46]   POSSIBILISTIC FUZZY C-MEANS CLUSTERING ON MEDICAL DIAGNOSTIC SYSTEMS [J].
Simhachalam, B. ;
Ganesan, G. .
2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, :1125-1129
[47]   A Fully-Unsupervised Possibilistic C-Means Clustering Algorithm [J].
Yang, Miin-Shen ;
Chang-Chien, Shou-Jen ;
Nataliani, Yessica .
IEEE ACCESS, 2018, 6 :78308-78320
[48]   Cluster Forests Based Fuzzy C-Means for Data Clustering [J].
Ben Ayed, Abdelkarim ;
Ben Halima, Mohamed ;
Alimi, Adel M. .
INTERNATIONAL JOINT CONFERENCE SOCO'16- CISIS'16-ICEUTE'16, 2017, 527 :564-573
[49]   Automatic Text Summarization using Fuzzy C-Means Clustering [J].
Anam, Shakil Ashraful ;
Rahman, A. M. Muntasir ;
Saleheen, Nasif Noor ;
Arif, Hossain .
2018 JOINT 7TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2018 2ND INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2018, :180-184
[50]   Geometrically guided Fuzzy C-Means clustering of multispectral images [J].
Noordam, JC ;
van der Broek, WHAM ;
Buydens, LMC .
MULTISPECTRAL AND HYPERSPECTRAL IMAGE ACQUISITION AND PROCESSING, 2001, 4548 :161-166