REAL-TIME DENOISING AND DEREVERBERATION WTIH TINY RECURRENT U-NET

被引:38
作者
Choi, Hyeong-Seok [1 ,2 ]
Park, Sungjin [1 ]
Lee, Jie Hwan [2 ]
Heo, Hoon [2 ]
Jeon, Dongsuk [1 ]
Lee, Kyogu [1 ,2 ]
机构
[1] Seoul Natl Univ, Artificial Intelligence Inst, Dept Intelligence & Informat, Seoul, South Korea
[2] Supertone Inc, Canoga Pk, CA 91307 USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年
关键词
real-time speech enhancement; lightweight network; denoising; dereverberation;
D O I
10.1109/ICASSP39728.2021.9414852
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Modern deep learning-based models have seen outstanding performance improvement with speech enhancement tasks. The number of parameters of state-of-the-art models, however, is often too large to be deployed on devices for real-world applications. To this end, we propose Tiny Recurrent U-Net (TRU-Net), a lightweight online inference model that matches the performance of current state-of-the-art models. The size of the quantized version of TRU-Net is 362 kilobytes, which is small enough to be deployed on edge devices. In addition, we combine the small-sized model with a new masking method called phase-aware beta-sigmoid mask, which enables simultaneous denoising and dereverberation. Results of both objective and subjective evaluations have shown that our model can achieve competitive performance with the current state-of-the-art models on benchmark datasets using fewer parameters by orders of magnitude.
引用
收藏
页码:5789 / 5793
页数:5
相关论文
共 32 条
  • [31] Xia YY, 2020, INT CONF ACOUST SPEE, P871, DOI [10.1109/ICASSP40776.2020.9054254, 10.1109/icassp40776.2020.9054254]
  • [32] Life Estimation of DC-Link Capacitor in Multi-operating Traction Drive System
    Yao, Bo
    Ge, Xinglai
    Shu, Lingzhou
    Wang, Huimin
    Hu, Zilang
    Gou, Bin
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON POWER ELECTRONICS AND ECCE ASIA (ICPE 2019 - ECCE ASIA), 2019, : 2743 - 2748