Voice Activity Detection in Presence of Transient Noise Using Spectral Clustering

被引：38

作者：

Mousazadeh, Saman ^{[1
]}

Cohen, Israel ^{[1
]}

机构：

[1] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2013年 / 21卷 / 06期

基金：

以色列科学基金会;

关键词：

Gaussian mixture model; spectral clustering; transient noise; voice activity detection; ACOUSTIC EVENT DETECTION;

D O I：

10.1109/TASL.2013.2248717

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Voice activity detection has attracted significant research efforts in the last two decades. Despite much progress in designing voice activity detectors, voice activity detection (VAD) in presence of transient noise is a challenging problem. In this paper, we develop a novel VAD algorithm based on spectral clustering methods. We propose a VAD technique which is a supervised learning algorithm. This algorithm divides the input signal into two separate clusters (i.e., speech presence and speech absence frames). We use labeled data in order to adjust the parameters of the kernel used in spectral clustering methods for computing the similarity matrix. The parameters obtained in the training stage together with the eigenvectors of the normalized Laplacian of the similarity matrix and Gaussianmixture model (GMM) are utilized to compute the likelihood ratio needed for voice activity detection. Simulation results demonstrate the advantage of the proposed method compared to conventional statistical model-based VAD algorithms in presence of transient noise.

引用

页码：1261 / 1271

页数：11

共 50 条

[21] On training targets for noise-robust voice activity detection
Braun, Sebastian
Tashev, Ivan
29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 421 - 425
[22] Efficient voice activity detection algorithm using long-term spectral flatness measure
Yanna Ma
Akinori Nishihara
EURASIP Journal on Audio, Speech, and Music Processing, 2013
[23] Voice Activity Detection Via Noise Reducing Using Non-Negative Sparse Coding
Teng, Peng
Jia, Yunde
IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (05) : 475 - 478
[24] A Statistical Model-Based Voice Activity Detection Using Multiple DNNs and Noise Awareness
Hwang, Inyoung
Sim, Jaeseong
Kim, Sang-Hyeon
Song, Kwang-Sub
Chang, Joon-Hyuk
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2277 - 2281
[25] Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets
Ivry, Amir
Berdugo, Baruch
Cohen, Israel
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (02) : 254 - 264
[26] VOICE ACTIVITY DETECTION USING SUBBAND NONCIRCULARITY
Wisdom, Scott
Okopal, Greg
Atlas, Les
Pitton, James
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4505 - 4509
[27] Voice activity detection using neural network
Ikedo, J
IEICE TRANSACTIONS ON COMMUNICATIONS, 1998, E81B (12) : 2509 - 2513
[28] A Novel Voice Activity Detection for Multi-Channel Noise Reduction
Colak, Ramazan
Akdeniz, Rafet
IEEE ACCESS, 2021, 9 (09): : 91017 - 91026
[29] DSP-based voice activity detection and background noise reduction
Singh C.
Venter M.
Muthu R.K.
Brown D.
International Journal of Speech Technology, 2018, 21 (04) : 851 - 859
[30] VOICE ACTIVITY DETECTION USING A PERIODICITY MEASURE
TUCKER, R
IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1992, 139 (04): : 377 - 380

← 1 2 3 4 5 →