PFRNet: Dual-Branch Progressive Fusion Rectification Network for Monaural Speech Enhancement

被引:12
作者
Yu, Runxiang [1 ,2 ]
Zhao, Ziwei [1 ,2 ]
Ye, Zhongfu [1 ,2 ]
机构
[1] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230027, Anhui, Peoples R China
[2] Natl Engn Res Ctr Speech & Language Informat Proc, Hefei 230027, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Transformers; Speech enhancement; Tensors; Convolution; Decoding; Time-frequency analysis; Fusion rectification block; interactive time-frequency improved transformer; monaural speech enhancement;
D O I
10.1109/LSP.2022.3222045
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, the transformer-based dual-branch magnitude and complex spectrum estimation framework achieves state-of-the-art performance for monaural speech enhancement. However, the insufficient utilization of the interactive information in the middle layers makes each branch lack the ability of compensation and rectification. To address this problem, this letter proposes a novel dual-branch progressive fusion rectification network (PFRNet) for monaural speech enhancement. PFRNet is an encoder-decoder-based dual-branch structure with interactive improved real & complex transformers. In PFRNet, the fusion rectification block is proposed to convert the implicit relationship of the two branches into a fusion feature by the frequency-domain mutual attention mechanism. The fusion feature provides a platform for the interaction in the middle layers. The interactive time-frequency improved real & complex transformer can make better use of the long-term dependencies in the time-frequency domain. Experimental results show that the proposed PFRNet outperforms most advanced dual-branch speech enhancement approaches and previous advanced systems in terms of speech quality and intelligibility.
引用
收藏
页码:2358 / 2362
页数:5
相关论文
共 50 条
  • [21] A Multiscale Dual-Branch Feature Fusion and Attention Network for Hyperspectral Images Classification
    Gao, Hongmin
    Zhang, Yiyan
    Chen, Zhonghao
    Li, Chenming
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 8180 - 8192
  • [22] FUSION OF HYPERSPECTRAL AND LIDAR DATA BASED ON DUAL-BRANCH CONVOLUTIONAL NEURAL NETWORK
    Wang, Jinzhe
    Zhang, Junping
    Guo, Qingle
    Li, Tong
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 3388 - 3391
  • [23] A Dual-branch Convolutional Network Architecture Processing on both Frequency and Time Domain for Single-channel Speech Enhancement
    Zhang, Kanghao
    He, Shulin
    Li, Hao
    Zhang, Xueliang
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (03)
  • [24] WDFF-Net: Weighted Dual-Branch Feature Fusion Network for Polyp Segmentation With Object-Aware Attention Mechanism
    Cao, Jie
    Wang, Xin
    Qu, Zhiwei
    Zhuo, Li
    Li, Xiaoguang
    Zhang, Hui
    Yang, Yang
    Wei, Wei
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (07) : 4118 - 4131
  • [25] A Dual-Branch Multidomain Feature Fusion Network for Axial Super-Resolution in Optical Coherence Tomography
    Xu, Quanqing
    He, Xiang
    Xu, Muhao
    Hu, Kaixuan
    Song, Weiye
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 461 - 465
  • [26] A Recursive Network with Dynamic Attention for Monaural Speech Enhancement
    Li, Andong
    Zheng, Chengshi
    Fan, Cunhang
    Peng, Renhua
    Li, Xiaodong
    INTERSPEECH 2020, 2020, : 2422 - 2426
  • [27] DBMA-Net: A Dual-Branch Multiattention Network for Polyp Segmentation
    Zhai, Chenxu
    Yang, Lei
    Liu, Yanhong
    Yu, Hongnian
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 16
  • [28] Complex nested U-Net-based speech enhancement model using a dual-branch decoder
    Hwang, Seorim
    Park, Sung Wook
    Park, Youngcheol
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2024, 43 (02): : 253 - 259
  • [29] SpecMNet: Spectrum mend network for monaural speech enhancement
    Fan, Cunhang
    Zhang, Hongmei
    Yi, Jiangyan
    Lv, Zhao
    Tao, Jianhua
    Li, Taihao
    Pei, Guanxiong
    Wu, Xiaopei
    Li, Sheng
    APPLIED ACOUSTICS, 2022, 194
  • [30] Dual-Branch Subpixel-Guided Network for Hyperspectral Image Classification
    Han, Zhu
    Yang, Jin
    Gao, Lianru
    Zeng, Zhiqiang
    Zhang, Bing
    Chanussot, Jocelyn
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62