End-to-End Learning for Physics-Based Acoustic Modeling

被引:6
|
作者
Gabrielli, Leonardo [1 ]
Tomassetti, Stefano [1 ]
Zinato, Carlo [2 ]
Piazza, Francesco [1 ]
机构
[1] Univ Politecn Marche, Dept Informat Engn, I-60121 Ancona, Italy
[2] Viscount Int SpA, I-47836 Mondaino, Italy
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2018年 / 2卷 / 02期
关键词
Physics-based acoustic modeling; end-to-end learning; convolutional neural networks; SOUND; ALGORITHM;
D O I
10.1109/TETCI.2017.2787125
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In past years, physics-based acoustic modeling developed theoretically to the point of yielding accurate understanding and description of a large number of acoustic phenomena, such as those involved in sound generation. Numerical algorithms have been proposed that are able to simulate these phenomena in real time with an acceptable computational cost, indeed reaching the market with commercial products. Sound synthesis based on physical models could benefit greatly from automated methods that require less specific know-how and save the sound-designer valuable time. This paper introduces a novel approach to parameter estimation in physics-based sound synthesis that is general and obtains good results based on an end-to-end computational intelligence paradigm. The approach is presented in a formal way and application to a practical use case is reported. Methodological issues, such as dataset generation, are investigated.
引用
收藏
页码:160 / 170
页数:11
相关论文
共 50 条
  • [1] LEARNING FILTERBANKS FOR END-TO-END ACOUSTIC BEAMFORMING
    Cornell, Samuele
    Pariente, Manuel
    Grondin, Francois
    Squartini, Stefano
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6507 - 6511
  • [2] End-to-end analysis modeling of vibrational spectroscopy based on deep learning approach
    Wang, Xin
    Yu, Long
    Tian, Shengwei
    Lv, Xiaoyi
    Meng, Xin
    Zhang, Wendong
    JOURNAL OF CHEMOMETRICS, 2020, 34 (10)
  • [3] END-TO-END LEARNING FOR MUSIC AUDIO
    Dieleman, Sander
    Schrauwen, Benjamin
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [4] End-to-End Learning Based on Autoencoder for Fronthaul
    Nie, Junyuan
    Zhang, Jing
    Jiang, Wenshan
    Qiu, Kun
    Dai, Xiaoxiao
    Yang, Qi
    2022 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE, ACP, 2022, : 953 - 956
  • [5] An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition
    Wu, Bo
    Li, Kehuang
    Ge, Fengpei
    Huang, Zhen
    Yang, Minglei
    Siniscalchi, Sabato Marco
    Lee, Chin-Hui
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (08) : 1289 - 1300
  • [6] Maximum-a-Posteriori-Based Decoding for End-to-End Acoustic Models
    Kanda, Naoyuki
    Lu, Xugang
    Kawai, Hisashi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 1023 - 1034
  • [7] End-to-End Learning-Based Image Compression: A Review
    Chen Jimin
    Lin Zehao
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (22)
  • [8] Amharic OCR: An End-to-End Learning
    Belay, Birhanu
    Habtegebrial, Tewodros
    Meshesha, Million
    Liwicki, Marcus
    Belay, Gebeyehu
    Stricker, Didier
    APPLIED SCIENCES-BASEL, 2020, 10 (03):
  • [9] Impact of Aliasing on Deep CNN-Based End-to-End Acoustic Models
    Gong, Yuan
    Poellabauer, Christian
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2698 - 2702
  • [10] Incremental End-to-End Learning for Lateral Control in Autonomous Driving
    Kwon, Jaerock
    Khalil, Aws
    Kim, Donghyun
    Nam, Haewoon
    IEEE ACCESS, 2022, 10 : 33771 - 33786