PFRNet: Dual-Branch Progressive Fusion Rectification Network for Monaural Speech Enhancement

被引:12
|
作者
Yu, Runxiang [1 ,2 ]
Zhao, Ziwei [1 ,2 ]
Ye, Zhongfu [1 ,2 ]
机构
[1] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230027, Anhui, Peoples R China
[2] Natl Engn Res Ctr Speech & Language Informat Proc, Hefei 230027, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Transformers; Speech enhancement; Tensors; Convolution; Decoding; Time-frequency analysis; Fusion rectification block; interactive time-frequency improved transformer; monaural speech enhancement;
D O I
10.1109/LSP.2022.3222045
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, the transformer-based dual-branch magnitude and complex spectrum estimation framework achieves state-of-the-art performance for monaural speech enhancement. However, the insufficient utilization of the interactive information in the middle layers makes each branch lack the ability of compensation and rectification. To address this problem, this letter proposes a novel dual-branch progressive fusion rectification network (PFRNet) for monaural speech enhancement. PFRNet is an encoder-decoder-based dual-branch structure with interactive improved real & complex transformers. In PFRNet, the fusion rectification block is proposed to convert the implicit relationship of the two branches into a fusion feature by the frequency-domain mutual attention mechanism. The fusion feature provides a platform for the interaction in the middle layers. The interactive time-frequency improved real & complex transformer can make better use of the long-term dependencies in the time-frequency domain. Experimental results show that the proposed PFRNet outperforms most advanced dual-branch speech enhancement approaches and previous advanced systems in terms of speech quality and intelligibility.
引用
收藏
页码:2358 / 2362
页数:5
相关论文
共 50 条
  • [21] Dual-Branch Multimodal Fusion Network for Driver Facial Emotion Recognition
    Wang, Le
    Chang, Yuchen
    Wang, Kaiping
    APPLIED SCIENCES-BASEL, 2024, 14 (20):
  • [22] Dual branch deep interactive UNet for monaural noisy-reverberant speech enhancement
    Zhang, Zehua
    Xu, Shiyun
    Zhuang, Xuyi
    Qian, Yukun
    Wang, Mingjiang
    APPLIED ACOUSTICS, 2023, 212
  • [23] Inverseformer: Dual-Branch Network With Attention Enhancement for Density Interface Inversion
    Yu, Ping
    Zhou, Long-Ran
    Zhao, Xiao
    Lu, Peng-Yu
    Huang, Guan-Lin
    Jiao, Jian
    Bi, Feng-Yi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [24] End-to-End Dual-Branch Network Towards Synthetic Speech Detection
    Ma, Kaijie
    Feng, Yifan
    Chen, Beijing
    Zhao, Guoying
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 359 - 363
  • [25] DUAL-BRANCH ATTENTION-IN-ATTENTION TRANSFORMER FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
    Yu, Guochen
    Li, Andong
    Zheng, Chengshi
    Guo, Yinuo
    Wang, Yutian
    Wang, Hui
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7847 - 7851
  • [26] Dual-branch fusion model for lensless imaging
    Zhang, Yinger
    Wu, Zhouyi
    Xu, Yunhui
    Huangfu, Jiangtao
    OPTICS EXPRESS, 2023, 31 (12) : 19463 - 19477
  • [27] A Multiscale Dual-Branch Feature Fusion and Attention Network for Hyperspectral Images Classification
    Gao, Hongmin
    Zhang, Yiyan
    Chen, Zhonghao
    Li, Chenming
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 8180 - 8192
  • [28] Source camera identification based on an adaptive dual-branch fusion residual network
    Hong Zheng
    Changhui You
    Tianyu Wang
    Jianping Ju
    Xi Li
    Multimedia Tools and Applications, 2024, 83 : 18479 - 18495
  • [29] DBIF: Dual-Branch Feature Extraction Network for Infrared and Visible Image Fusion
    Zhang, Haozhe
    Cui, Rongpu
    Zheng, Zhuohang
    Gao, Shaobing
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VIII, 2025, 15038 : 309 - 323
  • [30] Source camera identification based on an adaptive dual-branch fusion residual network
    Zheng, Hong
    You, Changhui
    Wang, Tianyu
    Ju, Jianping
    Li, Xi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (06) : 18479 - 18495