End-to-End Deep Learning-Based Adaptation Control for Linear Acoustic Echo Cancellation

被引：2

作者：

Haubner T. ^{[1
]}

Brendel A. ^{[1
,2
]}

Kellermann W. ^{[1
]}

机构：

[1] Friedrich-Alexander-Universität Erlangen-Nürnberg, Multimedia Communications and Signal Processing (LMS), Erlangen

[2] Fraunhofer Institute for Integrated Circuits (IIS), Erlangen

来源：

IEEE/ACM Transactions on Audio Speech and Language Processing | 2024年 / 32卷

关键词：

Acoustic echo cancellation; adaptation control; DNN; double-talk detection; step-size control; system identification;

D O I：

10.1109/TASLP.2023.3325923

中图分类号：

学科分类号：

摘要：

The attenuation of acoustic loudspeaker echoes remains to be one of the open challenges to achieve pleasant full-duplex hands free speech communication. In many modern signal enhancement interfaces, this problem is addressed by a linear acoustic echo canceler which subtracts a loudspeaker echo estimate from the recorded microphone signal. To obtain precise echo estimates, the parameters of the echo canceler, i.e., the filter coefficients, need to be estimated quickly and precisely from the observed loudspeaker and microphone signals. For this a sophisticated adaptation control is required to deal with high-power double-talk and rapidly track time-varying acoustic environments which are often faced with portable devices. In this paper, we address this problem by end-to-end deep learning. In particular, we suggest to infer the step-size for a least mean squares frequency-domain adaptive filter update by a Deep Neural Network (DNN). Two different step-size inference approaches are investigated. On the one hand broadband approaches, which use a single DNN to jointly infer step-sizes for all frequency bands, and on the other hand narrowband methods, which exploit individual DNNs per frequency band. The discussion of benefits and disadvantages of both approaches leads to a novel hybrid approach which shows improved echo cancellation while requiring only small DNN architectures. Furthermore, we investigate the effect of different loss functions, signal feature vectors, and DNN output layer architectures on the echo cancellation performance from which we obtain valuable insights into the general design and functionality of DNN-based adaptation control algorithms. © 2014 IEEE.

引用

页码：227 / 238

页数：11

共 51 条

[1] Hansler E., Schmidt G., Acoustic Echo and Noise Control: A Practical Approach., (2004)
[2] Enzner G., Buchner H., Favrot A., Kuech F., Acoustic echo control, Academic Press Library in Signal Processing, 4, pp. 807-877, (2014)
[3] Sridhar K., Et al., ICASSP 2021 acoustic echo cancellation challenge: Datasets, testing framework, and results, Proc. IEEE Int. Conf. Acoust. Speech Signal Process., pp. 151-155, (2021)
[4] Cutler R., Et al., Interspeech 2021 acoustic echo cancellation challenge, Proc. Interspeech, pp. 4748-4752, (2021)
[5] Cutler R., Et al., ICASSP 2022 acoustic echo cancellation challenge, Proc. IEEE Int. Conf. Acoust. Speech Signal Process., pp. 9107-9111, (2022)
[6] Haykin S., Adaptive Filter Theory, 4th Ed., (2002)
[7] Mader A., Puder H., Schmidt G.U., Step-size control for acoustic echo cancellation filters-An overview, Signal Process., 80, 9, pp. 1697-1719, (2000)
[8] Gansler T., Hansson M., Ivarsson C.-J., Salomonsson G., A doubletalk detector based on coherence, IEEE Trans. Commun., 44, 11, pp. 1421-1427, (1996)
[9] Benesty J., Morgan D.R., Cho J.H., A new class of doubletalk detectors based on cross-correlation, IEEE Speech Audio Process., 8, 2, pp. 168-172, (2000)
[10] Nitsch B.H., A frequency-selective stepfactor control for an adaptive filter algorithmworking in the frequency domain, Signal Process., 80, 9, pp. 1733-1745, (2000)

← 1 2 3 4 5 6 →