Deep Residual Network-Based Augmented Kalman Filter for Speech Enhancement

被引:0
|
作者
Roy, Sujan Kumar [1 ]
Paliwal, Kuldip K. [1 ]
机构
[1] Griffith Univ, Signal Proc Lab, Sch Engn & Built Environm, Brisbane, Qld 4111, Australia
来源
2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC) | 2020年
关键词
Speech enhancement; augmented Kalman filter; residual network; LPC; whitening filter; NOISE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech enhancement using augmented Kalman filter (AKF) suffers from the inaccurate estimates of the key parameters, linear prediction coefficients (LPCs) of speech and noise signal in noisy conditions. The existing AKF particularly enhances speech in colored noise conditions. In this paper, a deep residual network (ResNet)-based method utilizes the LPC estimates of the AKF for speech enhancement in various noise conditions. Specifically, a ResNet20 (constructed with 20 layers) gives an estimate of the noise waveform for each noisy speech frame to compute the noise LPC parameters. Each noisy speech frame is pre-whitened by a whitening filter, which is constructed with the corresponding noise LPCs. The speech LPC parameters are computed from the pre-whitened speech. The improved speech and noise LPC parameters enable the AKF to minimize residual noise as well as distortion in the enhanced speech. Objective and subjective testing on NOIZEUS corpus reveal that the proposed method exhibits higher quality and intelligibility in the enhanced speech than some benchmark methods in various noise conditions for a wide range of SNR levels.
引用
收藏
页码:667 / 673
页数:7
相关论文
共 50 条
  • [1] A Deep Neural Network Based Kalman Filter for Time Domain Speech Enhancement
    Yu, Hongjiang
    Ouyang, Zhiheng
    Zhu, Wei-Ping
    Champagne, Benoit
    Ji, Yunyun
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [2] Low-dimensional recurrent neural network-based Kalman filter for speech enhancement
    Xia, Youshen
    Wang, Jun
    NEURAL NETWORKS, 2015, 67 : 131 - 139
  • [3] Deep Learning with Augmented Kalman Filter for Single-Channel Speech Enhancement
    Roy, Sujan Kumar
    Nicolson, Aaron
    Paliwal, Kuldip K.
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [4] A Deep Learning-based Kalman Filter for Speech Enhancement
    Roy, Sujan Kumar
    Nicolson, Aaron
    Paliwal, Kuldip K.
    INTERSPEECH 2020, 2020, : 2692 - 2696
  • [5] DeepLPC: A Deep Learning Approach to Augmented Kalman Filter-Based Single-Channel Speech Enhancement
    Roy, Sujan Kumar
    Nicolson, Aaron
    Paliwal, Kuldip K.
    IEEE ACCESS, 2021, 9 : 64524 - 64538
  • [6] Subjective intelligibility of deep neural network-based speech enhancement
    Gelderblom, Femke B.
    Tronstad, Tron V.
    Viggen, Erlend Magnus
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1968 - 1972
  • [7] Gammatone Filter Bank-Deep Neural Network-based Monaural speech enhancement for unseen conditions
    Sivapatham, Shoba
    Kar, Asutosh
    Christensen, Mads Graesboll
    APPLIED ACOUSTICS, 2022, 194
  • [8] DeepResGRU: Residual gated recurrent neural network-augmented Kalman filtering for speech enhancement and recognition
    Saleem, Nasir
    Gao, Jiechao
    Khattak, Muhammad Irfan
    Rauf, Hafiz Tayyab
    Kadry, Seifedine
    Shafi, Muhammad
    KNOWLEDGE-BASED SYSTEMS, 2022, 238
  • [9] SHO based Deep Residual network and hierarchical speech features for speech enhancement
    Bhosle M.R.
    Narayaswamy N.K.
    International Journal of Speech Technology, 2023, 26 (02) : 355 - 370
  • [10] On supervised LPC estimation training targets for augmented Kalman filter-based speech enhancement
    Roy, Sujan Kumar
    Nicolson, Aaron
    Paliwal, Kuldip K.
    SPEECH COMMUNICATION, 2022, 142 : 49 - 60