Hybrid AHS: A Hybrid of Kalman Filter and Deep Learning for Acoustic Howling Suppression

被引:0
作者
Zhang, Hao [1 ]
Yu, Meng [1 ]
Wu, Yuzhong [2 ]
Yu, Tao [2 ]
Yu, Dong [1 ]
机构
[1] Tencent AI Lab, Bellevue, WA 98004 USA
[2] Tencent Ethereal Audio Lab, Shenzhen, Guangdong, Peoples R China
来源
INTERSPEECH 2023 | 2023年
关键词
acoustic howling suppression; Kalman filter; teacher forcing; Deep AHS; hybrid method; CANCELLATION;
D O I
10.21437/Interspeech.2023-984
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Deep learning has been recently introduced for efficient acoustic howling suppression (AHS). However, the recurrent nature of howling creates a mismatch between offline training and streaming inference, limiting the quality of enhanced speech. To address this limitation, we propose a hybrid method that combines a Kalman filter with a self-attentive recurrent neural network (SARNN) to leverage their respective advantages for robust AHS. During offline training, a pre-processed signal obtained from the Kalman filter and an ideal microphone signal generated via teacher-forced training strategy are used to train the deep neural network (DNN). During streaming inference, the DNN's parameters are fixed while its output serves as a reference signal for updating the Kalman filter. Evaluation in both offline and streaming inference scenarios using simulated and real-recorded data shows that the proposed method efficiently suppresses howling and consistently outperforms baselines.
引用
收藏
页码:834 / 838
页数:5
相关论文
共 28 条
[1]  
Albu F, 2018, IEEE ICC, P45, DOI 10.1109/ICComm.2018.8430141
[2]   IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS [J].
ALLEN, JB ;
BERKLEY, DA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) :943-950
[3]  
Berdahl Edgar, 2010, P 13 INT C DIG AUD E, V610
[4]   Nonlinear loudspeaker compensation for hands free acoustic echo cancellation [J].
Birkett, AN ;
Goubran, RA .
ELECTRONICS LETTERS, 1996, 32 (12) :1063-1064
[5]   A NEURAL NETWORK-BASED HOWLING DETECTION METHOD FOR REAL-TIME COMMUNICATION APPLICATIONS [J].
Chen, Zhipeng ;
Hao, Yiya ;
Chen, Yaobin ;
Chen, Gong ;
Ruan, Liang .
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, :206-210
[6]  
Du JY, 2018, Arxiv, DOI arXiv:1808.10583
[7]   Frequency-domain adaptive Kalman filter for acoustic echo control in hands-free telephones [J].
Enzner, G ;
Vary, P .
SIGNAL PROCESSING, 2006, 86 (06) :1140-1156
[8]   Howling Noise Cancellation in Time-Frequency Domain by Deep Neural Networks [J].
Gan, Huaguo ;
Luo, Gaoyong ;
Luo, Yaqing ;
Luo, Wenbin .
PROCEEDINGS OF SIXTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICICT 2021), VOL 2, 2022, 236 :319-332
[9]  
Gil-Cacho Pepe, 2009, 2009 17th European Signal Processing Conference (EUSIPCO 2009), P2574
[10]  
Goyal A, 2016, ADV NEUR IN, V29