Deep Noise Tracking Network: A Hybrid Signal Processing/Deep Learning Approach to Speech Enhancement

被引:0
|
作者
Nie, Shuai [1 ,3 ]
Liang, Shan [1 ]
Liu, Bin [1 ,3 ]
Zhang, Yaping [1 ,3 ]
Liu, Wenju [1 ]
Tao, Jianhua [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[2] CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
来源
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES | 2018年
基金
国家重点研发计划;
关键词
speech enhancement; noise tracking; deep learning; signal processing; RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Noise statistics and speech spectrum characteristics are the essential information for the single channel speech enhancement. The signal processing-based methods mainly rely on noise statistics estimation. They perform very well for stationary noise, but have remained difficult to cope with non-stationary noise. While the deep leaming-based methods mainly focus on the perception on the spectrum characteristics of speech and have a capacity in dealing with non-stationary noise. However, the performance would degrade dramatically for the unseen noise types, which could be due to the over-reliance on data and the ignorance to domain knowledge of signal process. Obviously, the hybrid signal processing/deep learning scheme may be a smart alternative. In this paper, we incorporate the powerful perceptual capabilities of deep learning in the conventional speech enhancement framework. Deep learning is used to estimate the speech presence probability and the update factor of noise statistics, which are then integrated into the Wiener filter-based speech enhancement structure to enhance the desired speech. All components are jointly optimized by a spectrum approximation objective. Systematic experiments on CHiME-4 and NOISEX-92 demonstrate the proposed hybrid signal processing/deep learning approach to noise suppression in noise-unmatched and noise-matched conditions.
引用
收藏
页码:3219 / 3223
页数:5
相关论文
共 50 条
  • [1] The Application of Deep Neural Network in Speech Enhancement Processing
    Chen Jian-ming
    Liang Zhi-cheng
    2018 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2018), 2018, : 1263 - 1266
  • [2] A Hybrid Approach for Single Channel Speech Enhancement using Deep Neural Network and Harmonic Regeneration Noise Reduction
    Jamal, Norezmi
    Fuad, N.
    Sha'abani, M. N. A. H.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (10) : 243 - 248
  • [3] Speech enhancement based on noise classification and deep neural network
    Wang, Wenbo
    Liu, Houguang
    Yang, Jianhua
    Cao, Guohua
    Hua, Chunli
    MODERN PHYSICS LETTERS B, 2019, 33 (17):
  • [4] iDeepMMSE: An Improved Deep Learning Approach to MMSE Speech and Noise Power Spectrum Estimation for Speech Enhancement
    Kim, Minseung
    Song, Hyungchan
    Cheong, Sein
    Shin, Jong Won
    INTERSPEECH 2022, 2022, : 181 - 185
  • [5] Deep neural network and noise classification-based speech enhancement
    Shi, Wenhua
    Zhang, Xiongwei
    Zou, Xia
    Han, Wei
    MODERN PHYSICS LETTERS B, 2017, 31 (19-21):
  • [6] A Deep Neural Network Based Harmonic Noise Model for Speech Enhancement
    Ouyang, Zhiheng
    Yu, Hongjiang
    Zhu, Wei-Ping
    Champagne, Benoit
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3224 - 3228
  • [7] Speech enhancement with noise estimation and filtration using deep learning models
    Kantamaneni, Sravanthi
    Charles, A.
    Babu, T. Ranga
    THEORETICAL COMPUTER SCIENCE, 2023, 941 : 14 - 28
  • [8] Speech enhancement with noise estimation and filtration using deep learning models
    Kantamaneni, Sravanthi
    Charles, A.
    Babu, T. Ranga
    THEORETICAL COMPUTER SCIENCE, 2023, 941 : 14 - 28
  • [9] Speech signal enhancement based on deep learning in distributed acoustic sensing
    Shang, Ying
    Yang, Jian
    Chen, Wang
    Yi, Jichao
    Sun, Maocheng
    Du, Yuankai
    Huang, Sheng
    Zhao, Wenan
    Qu, Shuai
    Wang, Weitao
    Lv, Lei
    Liu, Shuai
    Zhao, Yanjie
    Ni, Jiasheng
    OPTICS EXPRESS, 2023, 31 (03) : 4067 - 4079
  • [10] A Hybrid Deep-Learning Approach for Single Channel HF-SSB Speech Enhancement
    Chen, Yantao
    Dong, Binhong
    Zhang, Xiaoxue
    Gao, Pengyu
    Li, Shaoqian
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2021, 10 (10) : 2165 - 2169