DANN: Digital Audio Neural Network

被引:1
作者
Baracskai, Zlatko [1 ]
机构
[1] Univ West England, Dept Comp Sci & Creat Technol, Bristol, Avon, England
来源
2019 4TH INTERNATIONAL CONFERENCE ON SMART AND SUSTAINABLE TECHNOLOGIES (SPLITECH) | 2019年
关键词
digital audio; neural networks; audio synthesis; audio processing;
D O I
10.23919/splitech.2019.8783027
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Neural networks have been widely used in computer audio to deal with many synthesis and processing parameters. Experiments have also led to direct synthesis of audio using dynamic neural networks that exhibit oscillating behaviour, as they are clocked at audio rates. There is clearly great potential in convolutional networks as they directly implement complex resonances enabling acoustic interaction. It thus transpires that many different neural network types were used for direct audio synthesis and processing. The proposed innovation extends a fully-connected network of computational cells to facilitate long feedback delays (needed for waveguides) intertwined with zero delays (needed for polynomials). Another improvement is the network computing algorithm that allows for randomisation of delays by adjusting the order of calculations. The DANN model is a specification for network of cells and links that allows direct translation from digital audio synthesis and processing schemes. Taking basic examples such as exponential decay, logistic chaos and biquad filter it transpires that there are exact DANN equivalents to these basic building blocks. It is therefore suggested that complex DANN schemes can reproduce myriad synthesis and processing techniques in a transparent fashion. As a matter of proving the concept a simple snare drum synthesizer is translated to a DANN scheme in order to explore the parameter space in this domain. Thanks to the large amount of training schemes there is great potential be found in training networks for velocity sensitive synthesis to include complex physical interaction sensors. Audio signal processing applications may include modelling of analogue equipment and symbolic acoustic instrument processing. The simplicity of the network grants easy implementation on fast architectures to facilitate near real-time processing and higher sample rates. Beyond the practical uses proposed it is further suggested that network simplification procedures may lead to yet unknown compact schemes for complex signal processing problems in the audio domain, potentially reverberation and pitch-shifting.
引用
收藏
页码:171 / 174
页数:4
相关论文
共 6 条
  • [1] Back A. D., 1991, NEURAL COMPUTATION, V3
  • [2] Lapedes A., 1987, IEEE INT C NEUR NETW
  • [3] Pearlmutter B., 1990, DYNAMIC RECURRENT NE
  • [4] Sontag E., 1992, P 7 YAL WORKSH AD LE
  • [5] Uncini A., 2003, NEUROCOMPUTING, V55
  • [6] van den Oord Aaron, 2016, Wavenet: A generative model for raw audio, P125, DOI 10.48550/arXiv.1609.03499