PFRNet: Dual-Branch Progressive Fusion Rectification Network for Monaural Speech Enhancement

被引:12
作者
Yu, Runxiang [1 ,2 ]
Zhao, Ziwei [1 ,2 ]
Ye, Zhongfu [1 ,2 ]
机构
[1] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230027, Anhui, Peoples R China
[2] Natl Engn Res Ctr Speech & Language Informat Proc, Hefei 230027, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Transformers; Speech enhancement; Tensors; Convolution; Decoding; Time-frequency analysis; Fusion rectification block; interactive time-frequency improved transformer; monaural speech enhancement;
D O I
10.1109/LSP.2022.3222045
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, the transformer-based dual-branch magnitude and complex spectrum estimation framework achieves state-of-the-art performance for monaural speech enhancement. However, the insufficient utilization of the interactive information in the middle layers makes each branch lack the ability of compensation and rectification. To address this problem, this letter proposes a novel dual-branch progressive fusion rectification network (PFRNet) for monaural speech enhancement. PFRNet is an encoder-decoder-based dual-branch structure with interactive improved real & complex transformers. In PFRNet, the fusion rectification block is proposed to convert the implicit relationship of the two branches into a fusion feature by the frequency-domain mutual attention mechanism. The fusion feature provides a platform for the interaction in the middle layers. The interactive time-frequency improved real & complex transformer can make better use of the long-term dependencies in the time-frequency domain. Experimental results show that the proposed PFRNet outperforms most advanced dual-branch speech enhancement approaches and previous advanced systems in terms of speech quality and intelligibility.
引用
收藏
页码:2358 / 2362
页数:5
相关论文
共 50 条
  • [1] Scale-aware dual-branch complex convolutional recurrent network for monaural speech enhancement
    Li, Yihao
    Sun, Meng
    Zhang, Xiongwei
    Van Hamme, Hugo
    COMPUTER SPEECH AND LANGUAGE, 2024, 86
  • [2] DBT-Net: Dual-Branch Federative Magnitude and Phase Estimation With Attention-in-Attention Transformer for Monaural Speech Enhancement
    Yu, Guochen
    Li, Andong
    Wang, Hui
    Wang, Yutian
    Ke, Yuxuan
    Zheng, Chengshi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2629 - 2644
  • [3] A Dual-Branch Fusion Network for Surgical Instrument Segmentation
    Yang, Lei
    Zhai, Chenxu
    Wang, Hongyong
    Liu, Yanhong
    Bian, Guibin
    IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2024, 6 (04): : 1542 - 1554
  • [4] Inverseformer: Dual-Branch Network With Attention Enhancement for Density Interface Inversion
    Yu, Ping
    Zhou, Long-Ran
    Zhao, Xiao
    Lu, Peng-Yu
    Huang, Guan-Lin
    Jiao, Jian
    Bi, Feng-Yi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [5] A Dual-Branch Speech Enhancement Model with Harmonic Repair
    Jia, Lizhen
    Xu, Yanyan
    Ke, Dengfeng
    APPLIED SCIENCES-BASEL, 2024, 14 (04):
  • [6] Dual-Branch Network for Cloud and Cloud Shadow Segmentation
    Lu, Chen
    Xia, Min
    Qian, Ming
    Chen, Binyu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [7] Parallel Dual-Branch Polyp Segmentation Network
    Sun, Kunjie
    Cheng, Li
    Yuan, Haiwen
    Li, Xuan
    IEEE ACCESS, 2024, 12 : 192051 - 192061
  • [8] Ship Recognition for Complex SAR Images via Dual-Branch Transformer Fusion Network
    Sun, Zhongzhen
    Leng, Xiangguang
    Zhang, Xianghui
    Xiong, Boli
    Ji, Kefeng
    Kuang, Gangyao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [9] Dual-Branch Modeling Based on State-Space Model for Speech Enhancement
    Sun, Linhui
    Yuan, Shuo
    Gong, Aifei
    Ye, Lei
    Chng, Eng Siong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1457 - 1467
  • [10] LMDF-Net: Layered Multi-Scale Dual-Branch Dual-Temporal Fusion Network for Medical Image Segmentation
    Ye, Siyuan
    Wei, Yan
    IEEE ACCESS, 2024, 12 : 183078 - 183088