Hybrid AHS: A Hybrid of Kalman Filter and Deep Learning for Acoustic Howling Suppression

被引：0

作者：

Zhang, Hao ^{[1
]}

Yu, Meng ^{[1
]}

Wu, Yuzhong ^{[2
]}

Yu, Tao ^{[2
]}

Yu, Dong ^{[1
]}

机构：

[1] Tencent AI Lab, Bellevue, WA 98004 USA

[2] Tencent Ethereal Audio Lab, Shenzhen, Guangdong, Peoples R China

来源：

INTERSPEECH 2023 | 2023年

关键词：

acoustic howling suppression; Kalman filter; teacher forcing; Deep AHS; hybrid method; CANCELLATION;

D O I：

10.21437/Interspeech.2023-984

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Deep learning has been recently introduced for efficient acoustic howling suppression (AHS). However, the recurrent nature of howling creates a mismatch between offline training and streaming inference, limiting the quality of enhanced speech. To address this limitation, we propose a hybrid method that combines a Kalman filter with a self-attentive recurrent neural network (SARNN) to leverage their respective advantages for robust AHS. During offline training, a pre-processed signal obtained from the Kalman filter and an ideal microphone signal generated via teacher-forced training strategy are used to train the deep neural network (DNN). During streaming inference, the DNN's parameters are fixed while its output serves as a reference signal for updating the Kalman filter. Evaluation in both offline and streaming inference scenarios using simulated and real-recorded data shows that the proposed method efficiently suppresses howling and consistently outperforms baselines.

引用

页码：834 / 838

页数：5

共 28 条

[1]

Albu F, 2018, IEEE ICC, P45, DOI 10.1109/ICComm.2018.8430141

[2] IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS [J].

ALLEN, JB ;

BERKLEY, DA .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) :943-950

[3]

Berdahl Edgar, 2010, P 13 INT C DIG AUD E, V610

[4] Nonlinear loudspeaker compensation for hands free acoustic echo cancellation [J].

Birkett, AN ;

Goubran, RA .

ELECTRONICS LETTERS, 1996, 32 (12) :1063-1064

[5] A NEURAL NETWORK-BASED HOWLING DETECTION METHOD FOR REAL-TIME COMMUNICATION APPLICATIONS [J].

Chen, Zhipeng ;

Hao, Yiya ;

Chen, Yaobin ;

Chen, Gong ;

Ruan, Liang .

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, :206-210

[6]

Du JY, 2018, Arxiv, DOI arXiv:1808.10583

[7] Frequency-domain adaptive Kalman filter for acoustic echo control in hands-free telephones [J].

Enzner, G ;

Vary, P .

SIGNAL PROCESSING, 2006, 86 (06) :1140-1156

[8] Howling Noise Cancellation in Time-Frequency Domain by Deep Neural Networks [J].

Gan, Huaguo ;

Luo, Gaoyong ;

Luo, Yaqing ;

Luo, Wenbin .

PROCEEDINGS OF SIXTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICICT 2021), VOL 2, 2022, 236 :319-332

[9]

Gil-Cacho Pepe, 2009, 2009 17th European Signal Processing Conference (EUSIPCO 2009), P2574

[10]

Goyal A, 2016, ADV NEUR IN, V29

← 1 2 3 →