Voice Activity Detection in Presence of Transient Noise Using Spectral Clustering

被引:38
|
作者
Mousazadeh, Saman [1 ]
Cohen, Israel [1 ]
机构
[1] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2013年 / 21卷 / 06期
基金
以色列科学基金会;
关键词
Gaussian mixture model; spectral clustering; transient noise; voice activity detection; ACOUSTIC EVENT DETECTION;
D O I
10.1109/TASL.2013.2248717
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Voice activity detection has attracted significant research efforts in the last two decades. Despite much progress in designing voice activity detectors, voice activity detection (VAD) in presence of transient noise is a challenging problem. In this paper, we develop a novel VAD algorithm based on spectral clustering methods. We propose a VAD technique which is a supervised learning algorithm. This algorithm divides the input signal into two separate clusters (i.e., speech presence and speech absence frames). We use labeled data in order to adjust the parameters of the kernel used in spectral clustering methods for computing the similarity matrix. The parameters obtained in the training stage together with the eigenvectors of the normalized Laplacian of the similarity matrix and Gaussianmixture model (GMM) are utilized to compute the likelihood ratio needed for voice activity detection. Simulation results demonstrate the advantage of the proposed method compared to conventional statistical model-based VAD algorithms in presence of transient noise.
引用
收藏
页码:1261 / 1271
页数:11
相关论文
共 50 条
  • [11] A NEW APPROACH FOR ROBUST REALTIME VOICE ACTIVITY DETECTION USING SPECTRAL PATTERN
    Moattar, M. H.
    Homayounpour, M. M.
    Kalantari, Nima Khademi
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4478 - 4481
  • [12] ON USING SPECTRAL GRADIENT IN CONDITIONAL MAP CRITERION FOR ROBUST VOICE ACTIVITY DETECTION
    Choi, Jae-Hun
    Chang, Joon-Hyuk
    PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC 2012), 2012, : 370 - 374
  • [13] Robust voice activity detection directed by noise classification
    Saeedi, Jamal
    Ahadi, Seyed Mohammad
    Faez, Karim
    SIGNAL IMAGE AND VIDEO PROCESSING, 2015, 9 (03) : 561 - 572
  • [14] Voice activity detection in non-stationary noise
    Li Ye
    Wang Tong
    Cui Huijuan
    Tang Kun
    2006 IMACS: MULTICONFERENCE ON COMPUTATIONAL ENGINEERING IN SYSTEMS APPLICATIONS, VOLS 1 AND 2, 2006, : 1573 - +
  • [15] Influence of Noise and Voice Activity Detection on Speaker Verification
    Dustor, Adam
    COMPUTER NETWORKS, CN 2016, 2016, 608 : 207 - 215
  • [16] Robust voice activity detection directed by noise classification
    Jamal Saeedi
    Seyed Mohammad Ahadi
    Karim Faez
    Signal, Image and Video Processing, 2015, 9 : 561 - 572
  • [17] A Robust Voice Activity Detection Algorithm in Nonstationary Noise
    Lei, Jianjun
    Yang, Jiachen
    Wang, Jian
    Yang, Zhen
    2009 INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, PROCEEDINGS, 2009, : 195 - +
  • [18] Hard C-means clustering for voice activity detection
    Gorriz, J. M.
    Ramirez, J.
    Lang, E. W.
    Puntonet, C. G.
    SPEECH COMMUNICATION, 2006, 48 (12) : 1638 - 1649
  • [19] A robust voice activity detection based on noise eigenspace projection
    Ying, Dongwen
    Shi, Yu
    Soong, Frank
    Dang, Jianwu
    Lu, Xugang
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 76 - +
  • [20] DySANA: Dynamic Speech and Noise Adaptation for Voice Activity Detection
    Weiss, Ron J.
    Kristjansson, Trausti
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 127 - +