DBNet: A Dual-branch Network Architecture Processing on Spectrum and Waveform for Single-channel Speech Enhancement

被引：5

作者：

Zhang, Kanghao ^{[1
]}

He, Shulin ^{[1
]}

Li, Hao ^{[1
]}

Zhang, Xueliang ^{[1
]}

机构：

[1] Inner Mongolia Univ, Coll Comp Sci, Hohhot, Peoples R China

来源：

INTERSPEECH 2021 | 2021年

关键词：

Speech enhancement; time domain; frequency domain; real-time; NEURAL-NETWORK; CNN;

D O I：

10.21437/Interspeech.2021-1042

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

In real acoustic environment, speech enhancement is an arduous task to improve the quality and intelligibility of speech interfered by background noise and reverberation. Over the past years, deep learning has shown great potential on speech enhancement. In this paper, we propose a novel real-time framework called DBNet which is a dual-branch structure with alternate interconnection. Each branch incorporates an encoderdecoder architecture with skip connections. The two branches are responsible for spectrum and waveform modeling, respectively. A bridge layer is adopted to exchange information between the two branches. Systematic evaluation and comparison show that the proposed system substantially outperforms related algorithms under very challenging environments. And in INTERSPEECH 2021 Deep Noise Suppression (DNS) challenge, the proposed system ranks the top 8 in real-time track 1 in terms of the Mean Opinion Score (MOS) of the ITU-T P.835 framework.

引用

页码：2821 / 2825

页数：5

共 50 条

[41] End-to-End Dual-Branch Network Towards Synthetic Speech Detection
Ma, Kaijie
Feng, Yifan
Chen, Beijing
Zhao, Guoying
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 359 - 363
[42] Single-channel Speech Enhancement Using Graph Fourier Transform
Zhang, Chenhui
Pan, Xiang
INTERSPEECH 2022, 2022, : 946 - 950
[43] Hybrid quality measures for single-channel speech enhancement algorithms
Dreiseitel, P
EUROPEAN TRANSACTIONS ON TELECOMMUNICATIONS, 2002, 13 (02): : 159 - 165
[44] Single-channel multiple regression for in-car speech enhancement
Li, WF
Itou, K
Takeda, K
Itakura, F
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03) : 1032 - 1039
[45] Deep Learning Models for Single-Channel Speech Enhancement on Drones
Mukhutdinov, Dmitrii
Alex, Ashish
Cavallaro, Andrea
Wang, Lin
IEEE ACCESS, 2023, 11 : 22993 - 23007
[46] Single-channel speech enhancement based on frequency domain ALE
Nakanishi, Isao
Nagata, Yuudai
Itoh, Yoshio
Fukui, Yutaka
2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 2541 - 2544
[47] Single-channel speech enhancement using learnable loss mixup
Chang, Oscar
Tran, Dung N.
Koishida, Kazuhito
INTERSPEECH 2021, 2021, : 2696 - 2700
[48] A two-stage method for single-channel speech enhancement
Hamid, ME
Fukabayashi, T
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2006, E89A (04) : 1058 - 1068
[49] ON PHASE IMPORTANCE IN PARAMETER ESTIMATION IN SINGLE-CHANNEL SPEECH ENHANCEMENT
Mowlaee, Pejman
Saeidi, Rahim
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7462 - 7466
[50] Modified Amplitude Spectral Estimator for Single-Channel Speech Enhancement
Zhai, Zhenhui
Ou, Shifeng
Gao, Ying
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS (AMEII 2016), 2016, 73 : 1115 - 1120

← 1 2 3 4 5 →