Voice Activity Detection in Presence of Transient Noise Using Spectral Clustering

被引:38
|
作者
Mousazadeh, Saman [1 ]
Cohen, Israel [1 ]
机构
[1] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2013年 / 21卷 / 06期
基金
以色列科学基金会;
关键词
Gaussian mixture model; spectral clustering; transient noise; voice activity detection; ACOUSTIC EVENT DETECTION;
D O I
10.1109/TASL.2013.2248717
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Voice activity detection has attracted significant research efforts in the last two decades. Despite much progress in designing voice activity detectors, voice activity detection (VAD) in presence of transient noise is a challenging problem. In this paper, we develop a novel VAD algorithm based on spectral clustering methods. We propose a VAD technique which is a supervised learning algorithm. This algorithm divides the input signal into two separate clusters (i.e., speech presence and speech absence frames). We use labeled data in order to adjust the parameters of the kernel used in spectral clustering methods for computing the similarity matrix. The parameters obtained in the training stage together with the eigenvectors of the normalized Laplacian of the similarity matrix and Gaussianmixture model (GMM) are utilized to compute the likelihood ratio needed for voice activity detection. Simulation results demonstrate the advantage of the proposed method compared to conventional statistical model-based VAD algorithms in presence of transient noise.
引用
收藏
页码:1261 / 1271
页数:11
相关论文
共 50 条
  • [31] CLUSTERING AND SUPPRESSION OF TRANSIENT NOISE IN SPEECH SIGNALS USING DIFFUSION MAPS
    Talmon, Ronen
    Cohen, Israel
    Gannot, Sharon
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5084 - 5087
  • [32] Corner detection of contour images using spectral clustering
    Li, Xi
    Hu, Weiming
    Zhang, Zhongfei
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 1165 - +
  • [33] Two-Microphone Voice Activity Detection in the Presence of Coherent Interference
    Kim, Gibak
    Cho, Nam Ik
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1686 - 1689
  • [34] Noise robust voice activity detection based on switching Kalman filter
    Fujimoto, Masakiyo
    Ishizuka, Kentaro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03): : 467 - 477
  • [35] Robust Voice Activity Detection Feature Design Based on Spectral Kurtosis
    Zhang Shuyin
    Guo Ying
    Zhang Qun
    PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL III, 2009, : 269 - 272
  • [36] Noise Robust Voice Activity Detection Based on Switching Kalman Filter
    Fujimoto, Masakiyo
    Ishizuka, Kentaro
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 965 - 968
  • [37] Complete-Linkage Clustering for Voice Activity Detection in Audio and Visual Speech
    Ghaemmaghami, Houman
    Dean, David
    Kalantari, Shahram
    Sridharan, Sridha
    Fookes, Clinton
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2292 - 2296
  • [38] Robust Voice Activity Detection Using Feature Combination
    Haghani, Sahar Khaksar
    Ahadi, Seyed Mohammad
    2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [39] Crack Detection using Spectral Clustering Based on Crack Features
    Matsuoka, Takumi
    Matsushima, Kousuke
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2575 - 2578
  • [40] SAR Image Change Detection Using Watershed and Spectral Clustering
    Niu, Ruican
    Jiao, L. C.
    Wang, Guiting
    Feng, Jie
    MIPPR 2011: REMOTE SENSING IMAGE PROCESSING, GEOGRAPHIC INFORMATION SYSTEMS, AND OTHER APPLICATIONS, 2011, 8006