Voice Activity Detection in Presence of Transient Noise Using Spectral Clustering

被引:38
|
作者
Mousazadeh, Saman [1 ]
Cohen, Israel [1 ]
机构
[1] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2013年 / 21卷 / 06期
基金
以色列科学基金会;
关键词
Gaussian mixture model; spectral clustering; transient noise; voice activity detection; ACOUSTIC EVENT DETECTION;
D O I
10.1109/TASL.2013.2248717
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Voice activity detection has attracted significant research efforts in the last two decades. Despite much progress in designing voice activity detectors, voice activity detection (VAD) in presence of transient noise is a challenging problem. In this paper, we develop a novel VAD algorithm based on spectral clustering methods. We propose a VAD technique which is a supervised learning algorithm. This algorithm divides the input signal into two separate clusters (i.e., speech presence and speech absence frames). We use labeled data in order to adjust the parameters of the kernel used in spectral clustering methods for computing the similarity matrix. The parameters obtained in the training stage together with the eigenvectors of the normalized Laplacian of the similarity matrix and Gaussianmixture model (GMM) are utilized to compute the likelihood ratio needed for voice activity detection. Simulation results demonstrate the advantage of the proposed method compared to conventional statistical model-based VAD algorithms in presence of transient noise.
引用
收藏
页码:1261 / 1271
页数:11
相关论文
共 50 条
  • [1] VOICE ACTIVITY DETECTION IN TRANSIENT NOISE ENVIRONMENT USING LAPLACIAN PYRAMID ALGORITHM
    Spingarn, Nurit
    Mousazadeh, Saman
    Cohen, Israel
    2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 238 - 242
  • [2] Voice activity detection in the presence of transient based on graph
    Guo, Xiao-Yuan
    Gao, Chun-Xian
    Liu, Hui
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [3] Voice activity detection in the presence of transient based on graph
    Xiao-Yuan Guo
    Chun-Xian Gao
    Hui Liu
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [4] Voice Activity Detection In Presence Of Transients Using The Scattering Transform
    Dov, David
    Cohen, Israel
    2014 IEEE 28TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL (IEEEI), 2014,
  • [5] Robust voice activity detection based on noise eigenspace
    Ying, Dongwen
    Shi, Yu
    Lu, Xugang
    Dang, Jianwu
    Soong, Frank
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2007, 28 (06) : 413 - 423
  • [6] Kernel Method for Voice Activity Detection in the Presence of Transients
    Dov, David
    Talmon, Ronen
    Cohen, Israel
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) : 2313 - 2326
  • [7] On Noise Robust Voice Activity Detection
    Dekens, Tomas
    Verhelst, Werner
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2660 - 2663
  • [8] Voice activity detection in nonstationary noise
    Tanyer, SG
    Özer, H
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (04): : 478 - 482
  • [9] Speaker and Noise Independent Voice Activity Detection
    Germain, Francois G.
    Sun, Dennis L.
    Mysore, Gautham J.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 732 - 736
  • [10] A NEW APPROACH FOR ROBUST REALTIME VOICE ACTIVITY DETECTION USING SPECTRAL PATTERN
    Moattar, M. H.
    Homayounpour, M. M.
    Kalantari, Nima Khademi
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4478 - 4481