Double Branches and Stages Neural Network for Joint Acoustic Echo and Noise Suppression

被引：0

作者：

Zhu, Taohua ^{[1
]}

Qian, Guobing ^{[1
]}

机构：

[1] Southwest Univ, Coll Elect & Informat Engn, Chongqing 400715, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2024年 / 31卷

关键词：

Acoustic echo cancellation; complex domain; deep learning; double branches and stages; single-channel; SPEECH; CANCELLATION;

D O I：

10.1109/LSP.2024.3441492

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this letter, we propose a collaborative neural network framework for acoustic echo cancellation (AEC) tasks, where echo paths and spatial information are efficiently modeled through a two-stage subnetwork cascade to jointly repair the complex spectrum of the target speech. Specifically, we design a two-path structure consisting of a real part and an imaginary part as well as an amplitude phase, a lightweight network module to suppress part of the echo and potential noise in the first stage, and a larger network to estimate the complex residuals for phase correction and spectral restoration in the second stage. In order to maximize the performance of the model at each stage, we propose two convolutional modules: an inplace gate convolutional module and a complex squeezed temporal convolutional module (CSTCM). In addition, a cross-domain loss function is designed to improve the generalization capability. Experiments are conducted under various mismatch scenarios, and the results show that the proposed dual-path staging method provides superior performance over other advanced methods.

引用

页码：2150 / 2154

页数：5

共 36 条

[1]

Chen Z., ICASSP 2023 2023 IEE, P1

[2] A Two-stage Progressive Neural Network for Acoustic Echo Cancellation [J].

Chen, Zhuangqi ;

Xia, Xianjun ;

Chen, Cheng ;

Wang, Xianke ;

Leng, Yanhong ;

Chen, Li ;

Togneri, Roberto ;

Xiao, Yijian ;

Ding, Piao ;

Song, Shenyi ;

Zhang, Pingjian .

INTERSPEECH 2023, 2023, :795-799

[3] ICASSP 2022 ACOUSTIC ECHO CANCELLATION CHALLENGE [J].

Cutler, Ross ;

Saabas, Ando ;

Parnamaa, Tanel ;

Purin, Marju ;

Gamper, Hannes ;

Braun, Sebastian ;

Sorensen, Karsten ;

Aichner, Robert .

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, :9107-9111

[4] Learning With Learned Loss Function: Speech Enhancement With Quality-Net to Improve Perceptual Evaluation of Speech Quality [J].

Fu, Szu-Wei ;

Liao, Chien-Feng ;

Tsao, Yu .

IEEE SIGNAL PROCESSING LETTERS, 2020, 27 :26-30

[5]

Ge J., 2022, arXiv

[6] Context-Aware Poly(A) Signal Prediction Model via Deep Spatial-Temporal Neural Networks [J].

Guo, Yanbu ;

Zhou, Dongming ;

Li, Pu ;

Li, Chaoyang ;

Cao, Jinde .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) :8241-8253

[7] DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement [J].

Hu, Yanxin ;

Liu, Yun ;

Lv, Shubo ;

Xing, Mengtao ;

Zhang, Shimin ;

Fu, Yihui ;

Wu, Jian ;

Zhang, Bihong ;

Xie, Lei .

INTERSPEECH 2020, 2020, :2472-2476

[8] Alias-and-Separate: Wideband Speech Coding Using Sub-Nyquist Sampling and Speech Separation [J].

Hwang, Soojoong ;

Lee, Eunkyun ;

Jang, Inseon ;

Shin, Jong Won .

IEEE SIGNAL PROCESSING LETTERS, 2022, 29 :2003-2007

[9]

Ioffe Sergey, 2015, Proceedings of Machine Learning Research, V37, P448

[10]

Ju Y., ICASSP 2023 2023 IEE, P2

← 1 2 3 4 →