REAL-TIME JOINT NOISE SUPPRESSION AND BANDWIDTH EXTENSION OF NOISY REVERBERANT WIDEBAND SPEECH

被引：0

作者：

Gomez, Esteban ^{[1
,2
]}

Backstrom, Tom ^{[1
]}

机构：

[1] Aalto Univ, Dept Informat & Commun Engn, Espoo, Finland

[2] Voicemod Inc, Valencia, Spain

来源：

2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024 | 2024年

关键词：

Bandwidth extension; noise suppression; real-time; deep learning; multitasking; PERCEPTION;

D O I：

10.1109/IWAENC61483.2024.10694458

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Artificially extending the bandwidth of speech in real-time applications that are band-limited to 16 kHz (known as wide-band) or lower sample rates such as VoIP or communication over Bluetooth, can significantly improve its perceptual quality. Typically, dry clean speech is assumed as input to estimate the missing spectral information. However, such an assumption falls short if the input speech is reverberant or has been contaminated by noise, resulting in audible artifacts. We propose a real-time low-complexity multitasking neural network capable of performing noise suppression and bandwidth extension from 16 kHz to 48 kHz (fullband) on a CPU, preventing such issues even if the noise cannot be completely removed from the input. Instead of employing a monolithic model, we adopt a modular approach and complexity reduction methods that result in a more compact model than the sum of its parts while improving its performance.

引用

页码：6 / 10

页数：5

共 50 条

[31] Design and implementation of a real-time image noise canceller
Ma, J
Huang, XM
VISUAL INFORMATION PROCESSING XIII, 2004, 5438 : 273 - 281
[32] A multitask joint framework for real-time person search
Li, Ye
Yin, Kangning
Liang, Jie
Tan, Zhuofu
Wang, Xinzhong
Yin, Guangqiang
Wang, Zhiguo
MULTIMEDIA SYSTEMS, 2023, 29 (01) : 211 - 222
[33] A multitask joint framework for real-time person search
Ye Li
Kangning Yin
Jie Liang
Zhuofu Tan
Xinzhong Wang
Guangqiang Yin
Zhiguo Wang
Multimedia Systems, 2023, 29 : 211 - 222
[34] SPEECH RECOGNIZER OPTIMIZATION AND REAL-TIME IMPLEMENTATION ON A MULTITRANSPUTER ARRAY
CARAZO, J
ALEXANDRES, S
MORAN, J
MICROPROCESSING AND MICROPROGRAMMING, 1992, 34 (1-5): : 219 - 222
[35] Minimum Bandwidth Reservations for Periodic Streams in Wireless Real-Time Systems
Yi, Jun
Poellabauer, Christian
Hu, Xiaobo Sharon
Zhang, Liqiang
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2011, 10 (04) : 479 - 490
[36] A Low Computation Cost Model for Real-Time Speech Enhancement
Wang, Qirui
Zhou, Lin
Cao, Yanxiang
Zhuang, Chenghao
Cheng, Yunling
Deng, Yuxi
2024 13TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS, ICCCAS 2024, 2024, : 267 - 271
[37] ONLINE REPET-SIM FOR REAL-TIME SPEECH ENHANCEMENT
Rafii, Zafar
Pardo, Bryan
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 848 - 852
[38] Noise Suppression Method With Low-Complexity Noise Estimation Model and Heuristic Noise-Masking Algorithm for Real-Time Processing of Robot Vacuum Cleaners
Shin, Seunghyeon
Kim, Minhan
Jeon, Inkoo
Song, Ju-Man
Park, Yongjin
Son, Jungkwan
Lee, Seokjin
IEEE ACCESS, 2025, 13 : 789 - 801
[39] Real-Time Codebook-based Speech Enhancement with GPUs
Prasanna, A. N. Sai
Gurumurthyt, Iver Chandrashekaran
Naidu, D. H. R.
Baruith, Pallav Kuniar
2014 INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2014, : 306 - 311
[40] A Linear Programming Approach to Joint Scheduling of Real-Time and Non Real-Time Services in OFDMA-based Systems
Boujelben, Yassine
Ghandri, Abdennaceur
Mnif, Kais
2017 13TH INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING CONFERENCE (IWCMC), 2017, : 1268 - 1273

← 1 2 3 4 5 →