Deep Residual Network-Based Augmented Kalman Filter for Speech Enhancement

被引：0

作者：

Roy, Sujan Kumar ^{[1
]}

Paliwal, Kuldip K. ^{[1
]}

机构：

[1] Griffith Univ, Signal Proc Lab, Sch Engn & Built Environm, Brisbane, Qld 4111, Australia

来源：

2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC) | 2020年

关键词：

Speech enhancement; augmented Kalman filter; residual network; LPC; whitening filter; NOISE;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Speech enhancement using augmented Kalman filter (AKF) suffers from the inaccurate estimates of the key parameters, linear prediction coefficients (LPCs) of speech and noise signal in noisy conditions. The existing AKF particularly enhances speech in colored noise conditions. In this paper, a deep residual network (ResNet)-based method utilizes the LPC estimates of the AKF for speech enhancement in various noise conditions. Specifically, a ResNet20 (constructed with 20 layers) gives an estimate of the noise waveform for each noisy speech frame to compute the noise LPC parameters. Each noisy speech frame is pre-whitened by a whitening filter, which is constructed with the corresponding noise LPCs. The speech LPC parameters are computed from the pre-whitened speech. The improved speech and noise LPC parameters enable the AKF to minimize residual noise as well as distortion in the enhanced speech. Objective and subjective testing on NOIZEUS corpus reveal that the proposed method exhibits higher quality and intelligibility in the enhanced speech than some benchmark methods in various noise conditions for a wide range of SNR levels.

引用

页码：667 / 673

页数：7

共 50 条

[1] A Deep Neural Network Based Kalman Filter for Time Domain Speech Enhancement
Yu, Hongjiang
Ouyang, Zhiheng
Zhu, Wei-Ping
Champagne, Benoit
Ji, Yunyun
2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
[2] Low-dimensional recurrent neural network-based Kalman filter for speech enhancement
Xia, Youshen
Wang, Jun
NEURAL NETWORKS, 2015, 67 : 131 - 139
[3] Deep Learning with Augmented Kalman Filter for Single-Channel Speech Enhancement
Roy, Sujan Kumar
Nicolson, Aaron
Paliwal, Kuldip K.
2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
[4] A Deep Learning-based Kalman Filter for Speech Enhancement
Roy, Sujan Kumar
Nicolson, Aaron
Paliwal, Kuldip K.
INTERSPEECH 2020, 2020, : 2692 - 2696
[5] DeepLPC: A Deep Learning Approach to Augmented Kalman Filter-Based Single-Channel Speech Enhancement
Roy, Sujan Kumar
Nicolson, Aaron
Paliwal, Kuldip K.
IEEE ACCESS, 2021, 9 : 64524 - 64538
[6] Subjective intelligibility of deep neural network-based speech enhancement
Gelderblom, Femke B.
Tronstad, Tron V.
Viggen, Erlend Magnus
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1968 - 1972
[7] Gammatone Filter Bank-Deep Neural Network-based Monaural speech enhancement for unseen conditions
Sivapatham, Shoba
Kar, Asutosh
Christensen, Mads Graesboll
APPLIED ACOUSTICS, 2022, 194
[8] DeepResGRU: Residual gated recurrent neural network-augmented Kalman filtering for speech enhancement and recognition
Saleem, Nasir
Gao, Jiechao
Khattak, Muhammad Irfan
Rauf, Hafiz Tayyab
Kadry, Seifedine
Shafi, Muhammad
KNOWLEDGE-BASED SYSTEMS, 2022, 238
[9] SHO based Deep Residual network and hierarchical speech features for speech enhancement
Bhosle M.R.
Narayaswamy N.K.
International Journal of Speech Technology, 2023, 26 (02) : 355 - 370
[10] On supervised LPC estimation training targets for augmented Kalman filter-based speech enhancement
Roy, Sujan Kumar
Nicolson, Aaron
Paliwal, Kuldip K.
SPEECH COMMUNICATION, 2022, 142 : 49 - 60

← 1 2 3 4 5 →