REAL-TIME JOINT NOISE SUPPRESSION AND BANDWIDTH EXTENSION OF NOISY REVERBERANT WIDEBAND SPEECH

被引:0
|
作者
Gomez, Esteban [1 ,2 ]
Backstrom, Tom [1 ]
机构
[1] Aalto Univ, Dept Informat & Commun Engn, Espoo, Finland
[2] Voicemod Inc, Valencia, Spain
来源
2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024 | 2024年
关键词
Bandwidth extension; noise suppression; real-time; deep learning; multitasking; PERCEPTION;
D O I
10.1109/IWAENC61483.2024.10694458
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Artificially extending the bandwidth of speech in real-time applications that are band-limited to 16 kHz (known as wide-band) or lower sample rates such as VoIP or communication over Bluetooth, can significantly improve its perceptual quality. Typically, dry clean speech is assumed as input to estimate the missing spectral information. However, such an assumption falls short if the input speech is reverberant or has been contaminated by noise, resulting in audible artifacts. We propose a real-time low-complexity multitasking neural network capable of performing noise suppression and bandwidth extension from 16 kHz to 48 kHz (fullband) on a CPU, preventing such issues even if the noise cannot be completely removed from the input. Instead of employing a monolithic model, we adopt a modular approach and complexity reduction methods that result in a more compact model than the sum of its parts while improving its performance.
引用
收藏
页码:6 / 10
页数:5
相关论文
共 50 条
  • [41] Enhancing the usability of real-time speech recognition captioning through personalised displays and real-time multiple speaker editing and annotation
    Wald, Mike
    Bain, Keith
    UNIVERSAL ACCESS IN HUMAN-COMPUTER INTERACTION: APPLICATIONS AND SERVICES, PT 3, PROCEEDINGS, 2007, : 446 - +
  • [42] Temporally Stable Real-Time Joint Neural Denoising and Supersampling
    Thomas, Manu Mathew
    Liktor, Gabor
    Peters, Christoph
    Kim, Sungye
    Vaidyanathan, Karthik
    Forbes, Angus G.
    PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2022, 5 (03)
  • [43] Joint Rate Control and Scheduling for Real-Time Wireless Networks
    Zuo, Shuai
    Hou, I-Hong
    Liu, Tie
    Swami, Ananthram
    Basu, Prithwish
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2017, 16 (07) : 4562 - 4570
  • [44] A real-time machine learning application for browser extension security monitoring
    Fowdur, Tulsi Pawan
    Hosenally, Shuaib
    INFORMATION SECURITY JOURNAL, 2024, 33 (01): : 16 - 41
  • [45] Real-Time ROS Extension on Transparent CPU/GPU Coordination Mechanism
    Suzuki, Yuhei
    Azumi, Takuya
    Kato, Shinpei
    Nishio, Nobuhiko
    2018 IEEE 21ST INTERNATIONAL SYMPOSIUM ON REAL-TIME DISTRIBUTED COMPUTING (ISORC 2018), 2018, : 184 - 192
  • [46] A Real-time Scheduling Scheme of Quantized Control Systems under Bandwidth Constraints
    Liu, Guiyun
    Xu, Bugong
    2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 2334 - 2338
  • [47] Joint Detection and Active Cancellation of Snoring Signals in Real-Time
    Serafini, Luca
    Bruschi, Valeria
    Nobilit, Stefano
    Principi, Emanuele
    Cecchi, Stefania
    Squartini, Stefano
    2023 4TH INTERNATIONAL SYMPOSIUM ON THE INTERNET OF SOUNDS, 2023, : 295 - 303
  • [48] A Real-Time Dual-Microphone Speech Enhancement Algorithm Assisted by Bone Conduction Sensor
    Zhou, Yi
    Chen, Yufan
    Ma, Yongbao
    Liu, Hongqing
    SENSORS, 2020, 20 (18) : 1 - 17
  • [49] Real-Time Implementation of Cochlear Implant Speech Processing Pipeline on Smartphones
    Parris, Shane
    Torlak, Murat
    Kehtarnavaz, Nasser
    2014 36TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2014, : 886 - 889
  • [50] Learning Continuous Facial Actions From Speech for Real-Time Animation
    Pham, Hai X.
    Wang, Yuting
    Pavlovic, Vladimir
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (03) : 1567 - 1580