REAL-TIME JOINT NOISE SUPPRESSION AND BANDWIDTH EXTENSION OF NOISY REVERBERANT WIDEBAND SPEECH

被引:0
|
作者
Gomez, Esteban [1 ,2 ]
Backstrom, Tom [1 ]
机构
[1] Aalto Univ, Dept Informat & Commun Engn, Espoo, Finland
[2] Voicemod Inc, Valencia, Spain
来源
2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024 | 2024年
关键词
Bandwidth extension; noise suppression; real-time; deep learning; multitasking; PERCEPTION;
D O I
10.1109/IWAENC61483.2024.10694458
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Artificially extending the bandwidth of speech in real-time applications that are band-limited to 16 kHz (known as wide-band) or lower sample rates such as VoIP or communication over Bluetooth, can significantly improve its perceptual quality. Typically, dry clean speech is assumed as input to estimate the missing spectral information. However, such an assumption falls short if the input speech is reverberant or has been contaminated by noise, resulting in audible artifacts. We propose a real-time low-complexity multitasking neural network capable of performing noise suppression and bandwidth extension from 16 kHz to 48 kHz (fullband) on a CPU, preventing such issues even if the noise cannot be completely removed from the input. Instead of employing a monolithic model, we adopt a modular approach and complexity reduction methods that result in a more compact model than the sum of its parts while improving its performance.
引用
收藏
页码:6 / 10
页数:5
相关论文
共 50 条
  • [31] Design and implementation of a real-time image noise canceller
    Ma, J
    Huang, XM
    VISUAL INFORMATION PROCESSING XIII, 2004, 5438 : 273 - 281
  • [32] A multitask joint framework for real-time person search
    Li, Ye
    Yin, Kangning
    Liang, Jie
    Tan, Zhuofu
    Wang, Xinzhong
    Yin, Guangqiang
    Wang, Zhiguo
    MULTIMEDIA SYSTEMS, 2023, 29 (01) : 211 - 222
  • [33] A multitask joint framework for real-time person search
    Ye Li
    Kangning Yin
    Jie Liang
    Zhuofu Tan
    Xinzhong Wang
    Guangqiang Yin
    Zhiguo Wang
    Multimedia Systems, 2023, 29 : 211 - 222
  • [34] SPEECH RECOGNIZER OPTIMIZATION AND REAL-TIME IMPLEMENTATION ON A MULTITRANSPUTER ARRAY
    CARAZO, J
    ALEXANDRES, S
    MORAN, J
    MICROPROCESSING AND MICROPROGRAMMING, 1992, 34 (1-5): : 219 - 222
  • [35] Minimum Bandwidth Reservations for Periodic Streams in Wireless Real-Time Systems
    Yi, Jun
    Poellabauer, Christian
    Hu, Xiaobo Sharon
    Zhang, Liqiang
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2011, 10 (04) : 479 - 490
  • [36] A Low Computation Cost Model for Real-Time Speech Enhancement
    Wang, Qirui
    Zhou, Lin
    Cao, Yanxiang
    Zhuang, Chenghao
    Cheng, Yunling
    Deng, Yuxi
    2024 13TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS, ICCCAS 2024, 2024, : 267 - 271
  • [37] ONLINE REPET-SIM FOR REAL-TIME SPEECH ENHANCEMENT
    Rafii, Zafar
    Pardo, Bryan
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 848 - 852
  • [38] Noise Suppression Method With Low-Complexity Noise Estimation Model and Heuristic Noise-Masking Algorithm for Real-Time Processing of Robot Vacuum Cleaners
    Shin, Seunghyeon
    Kim, Minhan
    Jeon, Inkoo
    Song, Ju-Man
    Park, Yongjin
    Son, Jungkwan
    Lee, Seokjin
    IEEE ACCESS, 2025, 13 : 789 - 801
  • [39] Real-Time Codebook-based Speech Enhancement with GPUs
    Prasanna, A. N. Sai
    Gurumurthyt, Iver Chandrashekaran
    Naidu, D. H. R.
    Baruith, Pallav Kuniar
    2014 INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2014, : 306 - 311
  • [40] A Linear Programming Approach to Joint Scheduling of Real-Time and Non Real-Time Services in OFDMA-based Systems
    Boujelben, Yassine
    Ghandri, Abdennaceur
    Mnif, Kais
    2017 13TH INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING CONFERENCE (IWCMC), 2017, : 1268 - 1273