PFRNet: Dual-Branch Progressive Fusion Rectification Network for Monaural Speech Enhancement

被引:12
作者
Yu, Runxiang [1 ,2 ]
Zhao, Ziwei [1 ,2 ]
Ye, Zhongfu [1 ,2 ]
机构
[1] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230027, Anhui, Peoples R China
[2] Natl Engn Res Ctr Speech & Language Informat Proc, Hefei 230027, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Transformers; Speech enhancement; Tensors; Convolution; Decoding; Time-frequency analysis; Fusion rectification block; interactive time-frequency improved transformer; monaural speech enhancement;
D O I
10.1109/LSP.2022.3222045
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, the transformer-based dual-branch magnitude and complex spectrum estimation framework achieves state-of-the-art performance for monaural speech enhancement. However, the insufficient utilization of the interactive information in the middle layers makes each branch lack the ability of compensation and rectification. To address this problem, this letter proposes a novel dual-branch progressive fusion rectification network (PFRNet) for monaural speech enhancement. PFRNet is an encoder-decoder-based dual-branch structure with interactive improved real & complex transformers. In PFRNet, the fusion rectification block is proposed to convert the implicit relationship of the two branches into a fusion feature by the frequency-domain mutual attention mechanism. The fusion feature provides a platform for the interaction in the middle layers. The interactive time-frequency improved real & complex transformer can make better use of the long-term dependencies in the time-frequency domain. Experimental results show that the proposed PFRNet outperforms most advanced dual-branch speech enhancement approaches and previous advanced systems in terms of speech quality and intelligibility.
引用
收藏
页码:2358 / 2362
页数:5
相关论文
共 50 条
  • [41] Dilated convolutional recurrent neural network for monaural speech enhancement
    Pirhosseinloo, Shadi
    Brumberg, Jonathan S.
    CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 158 - 162
  • [42] PAN: PHONEME-AWARE NETWORK FOR MONAURAL SPEECH ENHANCEMENT
    Du, Zhihao
    Lei, Ming
    Han, Jiqing
    Zhang, Shiliang
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6634 - 6638
  • [43] REDUNDANT CONVOLUTIONAL NETWORK WITH ATTENTION MECHANISM FOR MONAURAL SPEECH ENHANCEMENT
    Lan, Tian
    Lyu, Yilan
    Hui, Guoqiang
    Mokhosi, Refuoe
    Li, Sen
    Liu, Qiao
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6654 - 6658
  • [44] Multi-stage attention network for monaural speech enhancement
    Wang, Kunpeng
    Lu, Wenjing
    Liu, Peng
    Yao, Juan
    Li, Huafeng
    IET SIGNAL PROCESSING, 2023, 17 (03)
  • [45] Highway Visibility Level Prediction Using Geometric and Visual Features Driven Dual-Branch Fusion Network
    Sun, Yubao
    Tang, Jihui
    Liu, Qingshan
    Zhang, Zhendong
    Huang, Liang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (08) : 9992 - 10004
  • [46] DHUnet: Dual-branch hierarchical global-local fusion network for whole slide image segmentation
    Wang, Lian
    Pan, Liangrui
    Wang, Hetian
    Liu, Mingting
    Feng, Zhichao
    Rong, Pengfei
    Chen, Zuo
    Peng, Shaoliang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
  • [47] Coarse-to-Fine Lung Nodule Segmentation in CT Images With Image Enhancement and Dual-Branch Network
    Wu, Zhitong
    Zhou, Qianjun
    Wang, Feng
    IEEE ACCESS, 2021, 9 (09): : 7255 - 7262
  • [48] TrUNet: Dual-Branch Network by Fusing CNN and Transformer for Skin Lesion Segmentation
    Chen, Wei
    Mu, Qian
    Qi, Jie
    IEEE ACCESS, 2024, 12 : 144174 - 144185
  • [49] LG-DBNet: Local and Global Dual-Branch Network for SAR Image Denoising
    Liu, Shuaiqi
    Tian, Shikang
    Zhao, Yuhang
    Hu, Qi
    Li, Bing
    Zhang, Yu-Dong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [50] Efficient Dual-Branch Information Interaction Network for Lightweight Image Super-Resolution
    Jin, Haonan
    Gao, Guangwei
    Li, Juncheng
    Guo, Zhenhua
    Yu, Yi
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73