REAL-TIME DENOISING AND DEREVERBERATION WTIH TINY RECURRENT U-NET

被引:37
|
作者
Choi, Hyeong-Seok [1 ,2 ]
Park, Sungjin [1 ]
Lee, Jie Hwan [2 ]
Heo, Hoon [2 ]
Jeon, Dongsuk [1 ]
Lee, Kyogu [1 ,2 ]
机构
[1] Seoul Natl Univ, Artificial Intelligence Inst, Dept Intelligence & Informat, Seoul, South Korea
[2] Supertone Inc, Canoga Pk, CA 91307 USA
关键词
real-time speech enhancement; lightweight network; denoising; dereverberation;
D O I
10.1109/ICASSP39728.2021.9414852
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Modern deep learning-based models have seen outstanding performance improvement with speech enhancement tasks. The number of parameters of state-of-the-art models, however, is often too large to be deployed on devices for real-world applications. To this end, we propose Tiny Recurrent U-Net (TRU-Net), a lightweight online inference model that matches the performance of current state-of-the-art models. The size of the quantized version of TRU-Net is 362 kilobytes, which is small enough to be deployed on edge devices. In addition, we combine the small-sized model with a new masking method called phase-aware beta-sigmoid mask, which enables simultaneous denoising and dereverberation. Results of both objective and subjective evaluations have shown that our model can achieve competitive performance with the current state-of-the-art models on benchmark datasets using fewer parameters by orders of magnitude.
引用
收藏
页码:5789 / 5793
页数:5
相关论文
共 50 条
  • [31] Recurrent U-Net for Resource-Constrained Segmentation
    Wang, Wei
    Yu, Kaicheng
    Hugonot, Joachim
    Fua, Pascal
    Salzmann, Mathieu
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2142 - 2151
  • [32] Application of recurrent U-net architecture to speech enhancement
    Grzywalski, Tomasz
    Drgas, Szymon
    2018 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2018, : 82 - 87
  • [33] Recurrent residual U-Net for medical image segmentation
    Alom, Md Zahangir
    Yakopcic, Chris
    Hasan, Mahmudul
    Taha, Tarek M.
    Asari, Vijayan K.
    JOURNAL OF MEDICAL IMAGING, 2019, 6 (01)
  • [34] Errors in real-time room acoustics dereverberation
    Hatziantoniou, PD
    Mourjopoulos, JN
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2004, 52 (09): : 883 - 899
  • [35] Errors in real-time room acoustics dereverberation
    Hatziantoniou, P.D., 1600, Audio Engineering Society (52):
  • [36] RatUNet: residual U-Net based on attention mechanism for image denoising
    Zhang, Huibin
    Lian, Qiusheng
    Zhao, Jianmin
    Wang, Yining
    Yang, Yuchi
    Feng, Suqin
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [37] Image Denoising Based On Deep Feature Fusion And U-Net Network
    Zhang, Yong
    Journal of Applied Science and Engineering, 2025, 28 (10): : 2077 - 2085
  • [38] Denoising PET images for proton therapy using a residual U-net
    Sano, Akira
    Nishio, Teiji
    Masuda, Takamitsu
    Karasawa, Kumiko
    BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2021, 7 (02)
  • [39] Image denoising based on Swin Transformer Residual Conv U-Net
    Gan, Yong
    Zhou, Shaohui
    Chen, Haonan
    Wang, Yuefeng
    27TH IEEE/ACIS INTERNATIONAL SUMMER CONFERENCE ON SOFTWARE ENGINEERING ARTIFICIAL INTELLIGENCE NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, SNPD 2024-SUMMER, 2024, : 166 - 170
  • [40] MULTI-TASK ATTENTIONAL U-NET FOR HYPERSPECTRAL IMAGE DENOISING
    Xiong, Fengchao
    Gu, Zhongyi
    Zheng, Wenbin
    Li, Tianhan
    Zhou, Jun
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 7336 - 7339