Time-frequency dual-domain attention for acoustic echo cancellationTime-frequency dual-domain...Y. Huang et al.

被引:0
|
作者
Yibo Huang [1 ]
Weidong Qin [2 ]
Zhiyong Li [1 ]
Qiuyu Zhang [2 ]
机构
[1] Northwest Normal University,College of Physics and Electronic Engineering
[2] Queensland University of Technology,School of Mechanical, Medical and Process Engineering
[3] Lanzhou University of Technology,School of Computer and Communication
关键词
Acoustic echo cancellation; Time-frequency dual-domain attention; Energy distribution; Dual-domain feature enhancement; Speech quality assessment;
D O I
10.1007/s11227-025-07200-2
中图分类号
学科分类号
摘要
Existing acoustic echo cancellation (AEC) technologies primarily focus on time-domain analysis, aiming to eliminate echo by modeling the long-range correlations of speech signals. However, these methods are limited in their ability to capture the dynamic variations in the frequency components of speech signals, thereby overlooking the significance of frequency-domain information. This paper proposes an energy distribution analysis method based on time-frequency (T-F) representation to address this issue. Introducing a dual-domain attention module (DDAM), which independently computes the local importance weights in both the frequency and time domains and multiplies these weights with the input features, accurately captures the most important time-frequency features of speech signals. In addition, the dual-domain feature enhancement block (DDFEB), which combines DDAM and convolutional layers, further enhances the multilevel representation of input features and integrates them into the encoder–decoder framework, effectively improving the representation of the time-frequency features. Experimental results show that the proposed method improves the perceptual evaluation of speech quality (PESQ) by 17.65% compared to the existing F-T-LSTM method and achieves a short-time objective intelligibility (STOI) score of 0.93. Furthermore, the proposed method increases the mean opinion score (MOS) by 0.33 compared to the existing DTLN-aec method, demonstrating its superiority in enhancing the user experience.
引用
收藏
相关论文
共 14 条
  • [1] Time-frequency dual-domain electromagnetic detection technology for buried pipelines
    Yao, Xiang
    Zhao, Chuntian
    Yao, Jin
    He, Zhanxiang
    Li, Hongmei
    NONDESTRUCTIVE TESTING AND EVALUATION, 2024, 39 (08) : 2354 - 2370
  • [2] Benthic nodal time-frequency dual-domain electromagnetic acquisition system and test
    Ren, Wenjing
    He, Zhanxiang
    Sun, Weibin
    Zhang, Dongyang
    Lu, Zhaoyang
    Lu, Yao
    Shiyou Diqiu Wuli Kantan/Oil Geophysical Prospecting, 2021, 56 (02): : 398 - 406
  • [3] Dual time-frequency domain system identification
    Aguero, Juan C.
    Tang, Wei
    Yuz, Juan I.
    Delgado, Ramon
    Goodwin, Graham C.
    AUTOMATICA, 2012, 48 (12) : 3031 - 3041
  • [4] An Interference Mitigation Method for FMCW Radar Based on Time-Frequency Distribution and Dual-Domain Fusion Filtering
    Zhou, Yu
    Cao, Ronggang
    Zhang, Anqi
    Li, Ping
    SENSORS, 2024, 24 (11)
  • [5] Dual Attention in Time and Frequency Domain for Voice Activity Detection
    Lee, Joohyung
    Jung, Youngmoon
    Kim, Hoirin
    INTERSPEECH 2020, 2020, : 3670 - 3674
  • [6] Channel estimation based on dual frequency domain Transformer in time-frequency doubly-selective fading underwater acoustic channels
    Cui, Xuerong
    Zhang, Chuang
    Li, Juan
    Jiang, Bin
    Li, Shibao
    Liu, Jianhang
    PHYSICAL COMMUNICATION, 2025, 68
  • [7] Identification of state-space systems using a dual time-frequency domain approach
    Agueero, Juan C.
    Yuz, Juan I.
    Goodwin, Graham C.
    Tang, Wei
    49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 2863 - 2868
  • [8] Joint resampling algorithm for parallel dual feedback time-frequency domain symbol timing recovery
    Zhang P.
    Zhang N.
    Wang D.
    Wu T.
    Li Z.
    Gong F.
    Tongxin Xuebao/Journal on Communications, 2023, 44 (04): : 15 - 26
  • [9] Dual-branch time-frequency domain anti-interference method for ship radiated noise signal
    Duan, Yichen
    Shen, Xiaohong
    Wang, Haiyan
    OCEAN ENGINEERING, 2023, 279
  • [10] FreqFaceNet: an enhanced transformer architecture with dual-order frequency attention for deepfake detectionFreqFaceNet: an enhanced transformer architecture with dual-order frequency attention for deepfake detectionV. Gupta et al.
    Varun Gupta
    Vaibhav Srivastava
    Ankit Yadav
    Dinesh Kumar Vishwakarma
    Narendra Kumar
    Applied Intelligence, 2025, 55 (7)