Deep Learning for Black-Box Modeling of Audio Effects

被引:22
|
作者
Ramirez, Marco A. Martinez [1 ]
Benetos, Emmanouil [1 ]
Reiss, Joshua D. [1 ]
机构
[1] Queen Mary Univ London, Ctr Digital Mus, Mile End Rd, London E1 4NS, England
来源
APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 02期
基金
英国工程与自然科学研究理事会;
关键词
black-box modeling; nonlinear; time-varying; audio effects; deep learning; tube amplifier; transistor-based limiter; Leslie speaker;
D O I
10.3390/app10020638
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Virtual analog modeling of audio effects consists of emulating the sound of an audio processor reference device. This digital simulation is normally done by designing mathematical models of these systems. It is often difficult because it seeks to accurately model all components within the effect unit, which usually contains various nonlinearities and time-varying components. Most existing methods for audio effects modeling are either simplified or optimized to a very specific circuit or type of audio effect and cannot be efficiently translated to other types of audio effects. Recently, deep neural networks have been explored as black-box modeling strategies to solve this task, i.e., by using only input-output measurements. We analyse different state-of-the-art deep learning models based on convolutional and recurrent neural networks, feedforward WaveNet architectures and we also introduce a new model based on the combination of the aforementioned models. Through objective perceptual-based metrics and subjective listening tests we explore the performance of these models when modeling various analog audio effects. Thus, we show virtual analog models of nonlinear effects, such as a tube preamplifier; nonlinear effects with memory, such as a transistor-based limiter and nonlinear time-varying effects, such as the rotating horn and rotating woofer of a Leslie speaker cabinet.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Fuzzy Modeling from Black-Box Data with Deep Learning Techniques
    de la Rosa, Erick
    Yu, Wen
    Sossa, Humberto
    ADVANCES IN NEURAL NETWORKS, PT I, 2017, 10261 : 304 - 312
  • [2] DIFFERENTIABLE SIGNAL PROCESSING WITH BLACK-BOX AUDIO EFFECTS
    Ramirez, Marco A. Martinez
    Wang, Oliver
    Smaragdis, Paris
    Bryan, Nicholas J.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 66 - 70
  • [3] A Deep Learning Based Macro Circuit Modeling for Black-box EMC Problems
    Jiang, Yang
    Gao, Richard Xian-Ke
    2021 JOINT IEEE INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC COMPATIBILITY, SIGNAL & POWER INTEGRITY, AND EMC EUROPE (EMC+SIPI AND EMC EUROPE), 2021, : 64 - 67
  • [4] Understanding the black-box: towards interpretable and reliable deep learning models
    Qamar, Tehreem
    Bawany, Narmeen Zakaria
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [5] Automated Deep Learning BLACK-BOX Attack for Multimedia P-BOX Security Assessment
    Tolba, Zakaria
    Derdour, Makhlouf
    Ferrag, Mohamed Amine
    Muyeen, S. M.
    Benbouzid, Mohamed
    IEEE ACCESS, 2022, 10 : 94019 - 94039
  • [6] Unbox the Black-Box: Predict and Interpret YouTube Viewership Using Deep Learning
    Xie, Jiaheng
    Chai, Yidong
    Liu, Xiao
    JOURNAL OF MANAGEMENT INFORMATION SYSTEMS, 2023, 40 (02) : 541 - 579
  • [7] Black-box modeling of a rapid sand filter
    van Ginneken, HLH
    Babuska, R
    Groennou, JT
    Kappelhof, JWNM
    Verbruggen, HB
    ARTIFICIAL INTELLIGENCE IN REAL-TIME CONTROL 1998, 1999, : 101 - 106
  • [8] Black-Box Audio Adversarial Example Generation Using Variational Autoencoder
    Zong, Wei
    Chow, Yang-Wai
    Susilo, Willy
    INFORMATION AND COMMUNICATIONS SECURITY (ICICS 2021), PT II, 2021, 12919 : 142 - 160
  • [9] Rearranging Pixels is a Powerful Black-Box Attack for RGB and Infrared Deep Learning Models
    Pomponi, Jary
    Dantoni, Daniele
    Alessandro, Nicolosi
    Scardapane, Simone
    IEEE ACCESS, 2023, 11 : 11298 - 11306
  • [10] Black-Box Reward Attacks Against Deep Reinforcement Learning Based on Successor Representation
    Cai, Kanting
    Zhu, Xiangbin
    Hu, Zhao-Long
    IEEE ACCESS, 2022, 10 : 51548 - 51560