REAL-TIME DENOISING AND DEREVERBERATION WTIH TINY RECURRENT U-NET

被引:37
|
作者
Choi, Hyeong-Seok [1 ,2 ]
Park, Sungjin [1 ]
Lee, Jie Hwan [2 ]
Heo, Hoon [2 ]
Jeon, Dongsuk [1 ]
Lee, Kyogu [1 ,2 ]
机构
[1] Seoul Natl Univ, Artificial Intelligence Inst, Dept Intelligence & Informat, Seoul, South Korea
[2] Supertone Inc, Canoga Pk, CA 91307 USA
关键词
real-time speech enhancement; lightweight network; denoising; dereverberation;
D O I
10.1109/ICASSP39728.2021.9414852
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Modern deep learning-based models have seen outstanding performance improvement with speech enhancement tasks. The number of parameters of state-of-the-art models, however, is often too large to be deployed on devices for real-world applications. To this end, we propose Tiny Recurrent U-Net (TRU-Net), a lightweight online inference model that matches the performance of current state-of-the-art models. The size of the quantized version of TRU-Net is 362 kilobytes, which is small enough to be deployed on edge devices. In addition, we combine the small-sized model with a new masking method called phase-aware beta-sigmoid mask, which enables simultaneous denoising and dereverberation. Results of both objective and subjective evaluations have shown that our model can achieve competitive performance with the current state-of-the-art models on benchmark datasets using fewer parameters by orders of magnitude.
引用
收藏
页码:5789 / 5793
页数:5
相关论文
共 50 条
  • [1] A Subconvolutional U-net with Gated Recurrent Unit and Efficient Channel Attention Mechanism for Real-Time Speech Enhancement
    Yechuri, Sivaramakrishna
    Vanambathina, Sunnydayal
    WIRELESS PERSONAL COMMUNICATIONS, 2024,
  • [2] U-Net for SPECT Image Denoising
    Reymann, Maximilian P.
    Wuerfl, Tobias
    Ritt, Philipp
    Stimpel, Bernhard
    Cachovan, Michal
    Vija, A. Hans
    Maier, Andreas
    2019 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (NSS/MIC), 2019,
  • [3] Real-Time Attentive Dilated U-Net for Extremely Dark Image Enhancement
    Huang, Junjian
    Ren, Hao
    Liu, Shulin
    Liu, Yong
    Lv, Chuanlu
    Lu, Jiawen
    Xie, Changyong
    Lu, Hong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (08)
  • [4] U-Net enhanced real-time LED-based photoacoustic imaging
    Paul, Avijit
    Mallidi, Srivalleesha
    JOURNAL OF BIOPHOTONICS, 2024, 17 (06)
  • [5] Real-time Water Area Segmentation for USV using Enhanced U-Net
    Ling, Gui
    Suo, Feiyang
    Lin, Zhen
    Li, Yanjun
    Xiang, Ji
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2533 - 2538
  • [6] Spectro-Temporal SubNet for Real-Time Monaural Speech Denoising and Dereverberation
    Xiong, Feifei
    Chen, Weiguang
    Wang, Pengyu
    Li, Xiaofei
    Feng, Jinwei
    INTERSPEECH 2022, 2022, : 931 - 935
  • [7] Seismic Signal Denoising using U-Net in the Time-Frequency Domain
    Chirtu, Mihail-Antonio
    Radoi, Anamaria
    2022 45TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING, TSP, 2022, : 6 - 10
  • [8] Medical Image Denoising with Recurrent Residual U-Net (R2U-Net) base Auto-Encoder
    Nasrin, Shamima
    Alom, Md Zahangir
    Burada, Ranga
    Taha, Tarek M.
    Asari, Vijayan K.
    PROCEEDINGS OF THE 2019 IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON), 2019, : 345 - 350
  • [9] Real-Time ConvNext-Based U-Net with Feature Infusion for Egg Microcrack Detection
    Shi, Chenbo
    Li, Yuejia
    Jiang, Xin
    Sun, Wenxin
    Zhu, Changsheng
    Mo, Yuanzheng
    Yan, Shaojia
    Zhang, Chun
    AGRICULTURE-BASEL, 2024, 14 (09):
  • [10] In-situ autocalibrated electrospinning process via U-Net based real-time monitoring
    Kim, Yeong-Seo
    Hyun, Goan-Woo
    Park, Suk-Hee
    JOURNAL OF MANUFACTURING PROCESSES, 2025, 137 : 397 - 407