REAL-TIME JOINT NOISE SUPPRESSION AND BANDWIDTH EXTENSION OF NOISY REVERBERANT WIDEBAND SPEECH

被引:0
|
作者
Gomez, Esteban [1 ,2 ]
Backstrom, Tom [1 ]
机构
[1] Aalto Univ, Dept Informat & Commun Engn, Espoo, Finland
[2] Voicemod Inc, Valencia, Spain
来源
2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024 | 2024年
关键词
Bandwidth extension; noise suppression; real-time; deep learning; multitasking; PERCEPTION;
D O I
10.1109/IWAENC61483.2024.10694458
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Artificially extending the bandwidth of speech in real-time applications that are band-limited to 16 kHz (known as wide-band) or lower sample rates such as VoIP or communication over Bluetooth, can significantly improve its perceptual quality. Typically, dry clean speech is assumed as input to estimate the missing spectral information. However, such an assumption falls short if the input speech is reverberant or has been contaminated by noise, resulting in audible artifacts. We propose a real-time low-complexity multitasking neural network capable of performing noise suppression and bandwidth extension from 16 kHz to 48 kHz (fullband) on a CPU, preventing such issues even if the noise cannot be completely removed from the input. Instead of employing a monolithic model, we adopt a modular approach and complexity reduction methods that result in a more compact model than the sum of its parts while improving its performance.
引用
收藏
页码:6 / 10
页数:5
相关论文
共 50 条
  • [1] Dual-Signal Transformation LSTM Network for Real-Time Noise Suppression
    Westhausen, Nils L.
    Meyer, Bernd T.
    INTERSPEECH 2020, 2020, : 2477 - 2481
  • [2] An efficient joint training model for monaural noisy-reverberant speech recognition
    Lian, Xiaoyu
    Xia, Nan
    Dai, Gaole
    Yang, Hongqin
    APPLIED ACOUSTICS, 2025, 228
  • [3] A real-time blind source separation scheme and its application to reverberant and noisy acoustic environments
    Aichner, R
    Buchner, H
    Yan, F
    Kellermann, W
    SIGNAL PROCESSING, 2006, 86 (06) : 1260 - 1277
  • [4] Real-time pre-processing for improved feature extraction of noisy speech
    P. P. Raj
    International Journal of Speech Technology, 2021, 24 : 715 - 728
  • [5] Real-time pre-processing for improved feature extraction of noisy speech
    Raj, P. P.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (03) : 715 - 728
  • [6] Toward Real-Time Noise Suppression for Acceleration Sensors on Resource- Constrained Processors
    Tehrani, Yas Hosseini
    Atarodi, Seyed Mojtaba
    IEEE SENSORS JOURNAL, 2024, 24 (24) : 42245 - 42254
  • [7] HLA real-time extension
    Zhao, H
    Georganas, ND
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2004, 16 (15) : 1503 - 1525
  • [8] DeepVQE: Real Time Deep Voice Quality Enhancement for Joint Acoustic Echo Cancellation, Noise Suppression and Dereverberation
    Indenbom, Evgenii
    Ristea, Nicolae-Catalin
    Saabas, Ando
    Parnamaa, Tanel
    Guzvin, Jegor
    Cutler, Ross
    INTERSPEECH 2023, 2023, : 3819 - 3823
  • [9] A Real-time Extension to the Android Platform
    Kalkov, Igor
    Franke, Dominik
    Schommer, John F.
    Kowalewski, Stefan
    PROCEEDINGS OF THE 10TH INTERNATIONAL WORKSHOP ON JAVA TECHNOLOGIES FOR REAL-TIME AND EMBEDDED SYSTEMS, 2012, : 105 - 114
  • [10] The Recognition of Whispered Speech in Real-Time
    Hendrickson, Kristi
    Ernest, Danielle
    EAR AND HEARING, 2022, 43 (02) : 554 - 562