DNN-Based Speech Enhancement Using Soft Audible Noise Masking for Wind Noise Reduction

被引:7
作者
Bai, Haichuan [1 ,2 ]
Ge, Fengpei [1 ,2 ]
Yan, Yonghong [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Speech Acoust & Content Understanding, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Xinjiang Lab Minor Speech & Language Informat Pro, Urumqi 830011, Peoples R China
基金
中国国家自然科学基金;
关键词
wind noise reduction; speech enhancement; soft audible noise masking; psychoacoustic model; deep neural network; ALGORITHM;
D O I
10.1109/CC.2018.8456465
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
This paper presents a deep neural network (DNN)-based speech enhancement algorithm based on the soft audible noise masking for the single-channel wind noise reduction. To reduce the low-frequency residual noise, the psychoacoustic model is adopted to calculate the masking threshold from the estimated clean speech spectrum. The gain for noise suppression is obtained based on soft audible noise masking by comparing the estimated wind noise spectrum with the masking threshold. To deal with the abruptly time-varying noisy signals, two separate DNN models are utilized to estimate the spectra of clean speech and wind noise components. Experimental results on the subjective and objective quality tests show that the proposed algorithm achieves the better performance compared with the conventional DNN-based wind noise reduction method.
引用
收藏
页码:235 / 243
页数:9
相关论文
共 24 条
  • [1] [Anonymous], 2006, COMMUNICATION TECHNO
  • [2] [Anonymous], 1992, 111723 ISOIEC JTC1SC
  • [3] [Anonymous], 2007, WID EXT REC P 862
  • [4] Noise estimation by minima controlled recursive averaging for robust speech enhancement
    Cohen, I
    Berdugo, B
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) : 12 - 15
  • [5] Fisher W., 1986, PROC DARPA WORKSHOP, P93
  • [6] Franco Patino S, 2010, PRACTICAS OFICIO, V6, P1
  • [7] Gerkmann T, 2011, 2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), P145, DOI 10.1109/ASPAA.2011.6082266
  • [8] DISTANCE MEASURES FOR SPEECH PROCESSING
    GRAY, AH
    MARKEL, JD
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (05): : 380 - 391
  • [9] A fast learning algorithm for deep belief nets
    Hinton, Geoffrey E.
    Osindero, Simon
    Teh, Yee-Whye
    [J]. NEURAL COMPUTATION, 2006, 18 (07) : 1527 - 1554
  • [10] Ioffe S., P 32 INT C MACH LEAR