DBNet: A Dual-branch Network Architecture Processing on Spectrum and Waveform for Single-channel Speech Enhancement

被引:5
|
作者
Zhang, Kanghao [1 ]
He, Shulin [1 ]
Li, Hao [1 ]
Zhang, Xueliang [1 ]
机构
[1] Inner Mongolia Univ, Coll Comp Sci, Hohhot, Peoples R China
来源
INTERSPEECH 2021 | 2021年
关键词
Speech enhancement; time domain; frequency domain; real-time; NEURAL-NETWORK; CNN;
D O I
10.21437/Interspeech.2021-1042
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In real acoustic environment, speech enhancement is an arduous task to improve the quality and intelligibility of speech interfered by background noise and reverberation. Over the past years, deep learning has shown great potential on speech enhancement. In this paper, we propose a novel real-time framework called DBNet which is a dual-branch structure with alternate interconnection. Each branch incorporates an encoderdecoder architecture with skip connections. The two branches are responsible for spectrum and waveform modeling, respectively. A bridge layer is adopted to exchange information between the two branches. Systematic evaluation and comparison show that the proposed system substantially outperforms related algorithms under very challenging environments. And in INTERSPEECH 2021 Deep Noise Suppression (DNS) challenge, the proposed system ranks the top 8 in real-time track 1 in terms of the Mean Opinion Score (MOS) of the ITU-T P.835 framework.
引用
收藏
页码:2821 / 2825
页数:5
相关论文
共 50 条
  • [1] A Dual-branch Convolutional Network Architecture Processing on both Frequency and Time Domain for Single-channel Speech Enhancement
    Zhang, Kanghao
    He, Shulin
    Li, Hao
    Zhang, Xueliang
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (03)
  • [2] DUAL-BRANCH ATTENTION-IN-ATTENTION TRANSFORMER FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
    Yu, Guochen
    Li, Andong
    Zheng, Chengshi
    Guo, Yinuo
    Wang, Yutian
    Wang, Hui
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7847 - 7851
  • [3] A Dual-Branch Interactive Fusion Network to Remove Artifacts From Single-Channel EEG
    Cui, Heng
    Li, Chang
    Liu, Aiping
    Qian, Ruobing
    Chen, Xun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 12
  • [4] Phase Processing for Single-Channel Speech Enhancement
    Gerkmann, Timo
    Krawczyk-Becker, Martin
    Le Roux, Jonathan
    IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (02) : 55 - 66
  • [5] DBNet: A Dual-Branch Network for Breast Cancer Classification in Ultrasound Images
    Meng, Hui
    Li, Qingfeng
    Liu, Xuefeng
    Wang, Yong
    Niu, Jianwei
    MEDICAL IMAGING 2022: COMPUTER-AIDED DIAGNOSIS, 2022, 12033
  • [6] Combine Waveform and Spectral Methods for Single-channel Speech Enhancement
    Li, Miao
    Zhang, Hui
    Zhang, Xueliang
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 47 - 52
  • [7] Single-Channel Speech Enhancement Using Double Spectrum
    Blass, Martin
    Mowlaee, Pejman
    Kleijn, W. Bastiaan
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1740 - 1744
  • [8] CompNet: Complementary network for single-channel speech enhancement
    Fan, Cunhang
    Zhang, Hongmei
    Li, Andong
    Xiang, Wang
    Zheng, Chengshi
    Lv, Zhao
    Wu, Xiaopei
    NEURAL NETWORKS, 2023, 168 : 508 - 517
  • [9] PFRNet: Dual-Branch Progressive Fusion Rectification Network for Monaural Speech Enhancement
    Yu, Runxiang
    Zhao, Ziwei
    Ye, Zhongfu
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2358 - 2362
  • [10] A Dual-Branch Speech Enhancement Model with Harmonic Repair
    Jia, Lizhen
    Xu, Yanyan
    Ke, Dengfeng
    APPLIED SCIENCES-BASEL, 2024, 14 (04):