A deep complex multi-frame filtering network for stereophonic acoustic echo cancellation

被引:1
作者
Cheng, Linjuan [1 ,2 ,3 ]
Zheng, Chengshi [1 ,3 ]
Li, Andong [1 ,3 ]
Wu, Yuquan [2 ,3 ]
Peng, Renhua [1 ,3 ]
Li, Xiaodong [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Noise & Vibrat Res, Beijing, Peoples R China
[2] Chinese Acad Sci, Sci & Technol Integrated Infomat Syst Lab, Inst Software, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
INTERSPEECH 2022 | 2022年
关键词
deep learning; stereophonic acoustic echo cancellation; multi-stage; SUPPRESSION; DATABASE; NOISE;
D O I
10.21437/Interspeech.2022-669
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In hands-free communication system, the coupling between loudspeaker and microphone generates echo signal, which can severely influence the quality of communication. Meanwhile, various types of noise in communication environments further reduce speech quality and intelligibility. It is difficult to extract the near-end signal from the microphone signal within one step, especially in low signal-to-noise ratio scenarios. In this paper, we propose a deep complex network approach to address this issue. Specially, we decompose the stereophonic acoustic echo cancellation into two stages, including linear stereophonic acoustic echo cancellation module and residual echo suppression module, where both modules are based on deep learning architectures. A multi-frame filtering strategy is introduced to benefit the estimation of linear echo by capturing more interframe information. Moreover, we decouple the complex spectral mapping into magnitude estimation and complex spectrum refinement. Experimental results demonstrate that our proposed approach achieves stage-of-the-art performance over previous advanced algorithms under various conditions.
引用
收藏
页码:2508 / 2512
页数:5
相关论文
共 29 条
[1]   IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS [J].
ALLEN, JB ;
BERKLEY, DA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) :943-950
[2]  
Amand F., 1996, EUR SIGN PROC C 1996, P1
[3]   A frequency domain stereophonic acoustic echo canceler exploiting the coherence between the channels [J].
Benesty, J ;
Gilloire, A .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (03) :L30-L35
[4]  
BENESTY J., 2001, ADV NETWORK ACOUSTIC
[5]  
Benesty J., 1997, IEEE INT C AC
[6]  
Carbajal G, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P231, DOI 10.1109/ICASSP.2018.8461476
[7]  
Cheng L., 2021, ACTA ACUSTICA
[8]   Deep learning-based stereophonic acoustic echo suppression without decorrelation [J].
Cheng, Linjuan ;
Peng, Renhua ;
Li, Andong ;
Zheng, Chengshi ;
Li, Xiaodong .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 150 (02) :816-829
[9]  
Dean D, 2010, 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, P3110
[10]  
Enzner G., 2001, P IWAENC 2001 DARMST