Hard C-means clustering for voice activity detection

被引:34
作者
Gorriz, J. M. [1 ]
Ramirez, J.
Lang, E. W.
Puntonet, C. G.
机构
[1] Univ Granada, Dept Signal Theory Networking & Commun, E-18071 Granada, Spain
[2] Univ Regensburg, Inst Biophys, D-93040 Regensburg, Germany
[3] Univ Granada, Dept Comp Architecture & Technol, E-18071 Granada, Spain
关键词
voice activity detection; speech recognition; clustering; C-means; prototypes; subband energy;
D O I
10.1016/j.specom.2006.07.006
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An effective voice activity detection (VAD) algorithm is proposed for improving speech recognition performance in noisy environments. The proposed speech/pause discrimination method is based on a hard-decision clustering approach built on a set of subband log-energies and noise prototypes that define a cluster. Detecting the presence of speech (a new cluster) is achieved using a basic sequential algorithm scheme (BSAS) according to a given "distance" (in this case, geometrical distance) and a suitable threshold. The accuracy of the Cluster VAD (CIVAD) algorithm lies in the use of a decision function defined over a multiple-observation (MO) window of averaged subband log-energies and a suitable noise subspace model defined in terms of prototypes. In addition, the reduced computational cost of the clustering approach makes it adequate for real-time applications, i.e. speech recognition. An exhaustive analysis is conducted on the Spanish SpeechDat-Car databases in order to assess the performance of the proposed method and to compare it to existing standard VAD methods. The results show improvements in detection accuracy over standard VADs such as ITU-T G.729, ETSI GSM AMR and ETSI AFE and a representative set of recently reported VAD algorithms for noise robust speech processing. (C) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:1638 / 1649
页数:12
相关论文
共 50 条
  • [31] Kernel methods for clustering:: Competitive learning and c-means
    Inokuchi, Ryo
    Miyamoto, Sadaaki
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2006, 14 (04) : 481 - 493
  • [32] Effective fuzzy c-means clustering algorithms for data clustering problems
    Kannan, S. R.
    Ramathilagam, S.
    Chung, P. C.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (07) : 6292 - 6300
  • [33] Sparse learning based fuzzy c-means clustering
    Gu, Jing
    Jiao, Licheng
    Yang, Shuyuan
    Zhao, Jiaqi
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 119 : 113 - 125
  • [34] Density-Weighted Fuzzy c-Means Clustering
    Hathaway, Richard J.
    Hu, Yingkang
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2009, 17 (01) : 243 - 252
  • [35] Analytically tractable case of fuzzy c-means clustering
    Pianykh, OS
    [J]. PATTERN RECOGNITION, 2006, 39 (01) : 35 - 46
  • [36] Gaussian-kernel c-means clustering algorithms
    Shou-Jen Chang-Chien
    Yessica Nataliani
    Miin-Shen Yang
    [J]. Soft Computing, 2021, 25 : 1699 - 1716
  • [37] Fuzzy C-Means Clustering and Sonification of HRV Features
    Borthakur, Debanjan
    Grace, Victoria
    Batchelor, Paul
    Dubey, Harishchandra
    [J]. 2019 4TH IEEE/ACM INTERNATIONAL CONFERENCE ON CONNECTED HEALTH: APPLICATIONS, SYSTEMS AND ENGINEERING TECHNOLOGIES (CHASE), 2019, : 53 - 57
  • [38] Sparse Fuzzy C-Means Clustering with Lasso Penalty
    Parveen, Shazia
    Yang, Miin-Shen
    [J]. SYMMETRY-BASEL, 2024, 16 (09):
  • [39] Projected fuzzy C-means clustering with locality preservation
    Zhou, Jie
    Pedrycz, Witold
    Yue, Xiaodong
    Gao, Can
    Lai, Zhihui
    Wan, Jun
    [J]. PATTERN RECOGNITION, 2021, 113
  • [40] High speed c-means clustering in reconfigurable hardware
    Hwang, Wen-Jyi
    Hsu, Chih-Chieh
    Li, Hui-Ya
    Weng, Sheng-Kai
    Yu, Tsung-Yi
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 2010, 34 (06) : 237 - 246