A Nested U-Net with Efficient Channel Attention and D3Net for Speech Enhancement

被引:0
|
作者
Sivaramakrishna Yechuri
Sunnydayal Vanambathina
机构
[1] VIT-AP University,SENSE
关键词
Time domain; Multi-scale feature extraction block; D3Net; Efficient channel attention; Cross-channel interaction;
D O I
暂无
中图分类号
学科分类号
摘要
The advanced improvements in deep learning neural networks in the speech enhancement area have vastly improved. The performance of speech enhancement is still limited because widely used existing techniques cannot fully exploit contextual information from multiple scales. To address this issue, we propose a nested U-Net with efficient channel attention and D3Net (ECAD3MUNet) for speech enhancement. The proposed ECAD3MUNet is an encoder and decoder model with skip connections to improve information flow. In ECAD3MUNet, a novel densely connected dilated DenseNet (D3Net) block is incorporated with a multi-scale feature extraction block to explore large-scale contextual information. In this way, the benefits of local and global features can be completely leveraged to increase speech reconstruction abilities. D3Net uses revolutionary multi-dilated convolution with a variable dilation factor in a single layer to simulate many resolutions at the same time. D3Net improves the growth of a receptive field and the simultaneous modeling of multi-resolution data in a single convolution layer. D3Net addresses the aliasing problem that occurs when we naively include dilated convolution in the DenseNet model. Additionally, a novel cross-channel interaction can be implemented via the efficient channel attention (ECA) module without dimensionality reduction. In module testing, choosing an adaptable kernel size for the ECA improved network performance significantly. We incorporated the D3Net and ECA modules into the proposed model for better feature extraction and utterance-level context aggregation. The proposed ECAD3MUNet model experimental results outperform other baseline models in objective speech quality and intelligibility scores.
引用
收藏
页码:4051 / 4071
页数:20
相关论文
共 50 条
  • [1] A Nested U-Net with Efficient Channel Attention and D3Net for Speech Enhancement
    Yechuri, Sivaramakrishna
    Vanambathina, Sunnydayal
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (07) : 4051 - 4071
  • [2] CHANNEL-ATTENTION DENSE U-NET FOR MULTICHANNEL SPEECH ENHANCEMENT
    Tolooshams, Bahareh
    Giri, Ritwik
    Song, Andrew H.
    Isik, Umut
    Krishnaswamy, Arvindh
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 836 - 840
  • [3] A Nested U-Net With Self-Attention and Dense Connectivity for Monaural Speech Enhancement
    Xiang, Xiaoxiao
    Zhang, Xiaojuan
    Chen, Haozhe
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 105 - 109
  • [4] Single channel speech enhancement using time-frequency attention mechanism based nested U-net model
    Prathipati, Anil Kumar
    Chakravarthy, A. S. N.
    ENGINEERING RESEARCH EXPRESS, 2024, 6 (03):
  • [5] Stacked U-Net with Time-Frequency Attention and Deep Connection Net for Single Channel Speech Enhancement
    Parisae, Veeraswamy
    Bhavanam, S. Nagakishore
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2024,
  • [6] Shuffle Attention U-Net for Speech Enhancement in Time Domain
    Jannu, Chaitanya
    Vanambathina, Sunny Dayal
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2024, 24 (04)
  • [7] A Subconvolutional U-net with Gated Recurrent Unit and Efficient Channel Attention Mechanism for Real-Time Speech Enhancement
    Yechuri, Sivaramakrishna
    Vanambathina, Sunnydayal
    WIRELESS PERSONAL COMMUNICATIONS, 2024,
  • [8] ECAU-Net: Efficient channel attention U-Net for fetal ultrasound cerebellum segmentation
    Shu, Xin
    Chang, Feng
    Zhang, Xin
    Shao, Changbin
    Yang, Xibei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 75
  • [9] ECAU-Net: Efficient channel attention U-Net for fetal ultrasound cerebellum segmentation
    Shu, Xin
    Chang, Feng
    Zhang, Xin
    Shao, Changbin
    Yang, Xibei
    Biomedical Signal Processing and Control, 2022, 75
  • [10] SCU-Net plus plus : A Nested U-Net Based on Sharpening Filter and Channel Attention Mechanism
    Cui, Hu
    Pan, Haiwei
    Zhang, Kejia
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022