End-to-End Learning for Physics-Based Acoustic Modeling

被引：6

作者：

Gabrielli, Leonardo ^{[1
]}

Tomassetti, Stefano ^{[1
]}

Zinato, Carlo ^{[2
]}

Piazza, Francesco ^{[1
]}

机构：

[1] Univ Politecn Marche, Dept Informat Engn, I-60121 Ancona, Italy

[2] Viscount Int SpA, I-47836 Mondaino, Italy

来源：

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2018年 / 2卷 / 02期

关键词：

Physics-based acoustic modeling; end-to-end learning; convolutional neural networks; SOUND; ALGORITHM;

D O I：

10.1109/TETCI.2017.2787125

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In past years, physics-based acoustic modeling developed theoretically to the point of yielding accurate understanding and description of a large number of acoustic phenomena, such as those involved in sound generation. Numerical algorithms have been proposed that are able to simulate these phenomena in real time with an acceptable computational cost, indeed reaching the market with commercial products. Sound synthesis based on physical models could benefit greatly from automated methods that require less specific know-how and save the sound-designer valuable time. This paper introduces a novel approach to parameter estimation in physics-based sound synthesis that is general and obtains good results based on an end-to-end computational intelligence paradigm. The approach is presented in a formal way and application to a practical use case is reported. Methodological issues, such as dataset generation, are investigated.

引用

页码：160 / 170

页数：11

共 50 条

[1] LEARNING FILTERBANKS FOR END-TO-END ACOUSTIC BEAMFORMING
Cornell, Samuele
Pariente, Manuel
Grondin, Francois
Squartini, Stefano
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6507 - 6511
[2] End-to-end analysis modeling of vibrational spectroscopy based on deep learning approach
Wang, Xin
Yu, Long
Tian, Shengwei
Lv, Xiaoyi
Meng, Xin
Zhang, Wendong
JOURNAL OF CHEMOMETRICS, 2020, 34 (10)
[3] END-TO-END LEARNING FOR MUSIC AUDIO
Dieleman, Sander
Schrauwen, Benjamin
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[4] End-to-End Learning Based on Autoencoder for Fronthaul
Nie, Junyuan
Zhang, Jing
Jiang, Wenshan
Qiu, Kun
Dai, Xiaoxiao
Yang, Qi
2022 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE, ACP, 2022, : 953 - 956
[5] An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition
Wu, Bo
Li, Kehuang
Ge, Fengpei
Huang, Zhen
Yang, Minglei
Siniscalchi, Sabato Marco
Lee, Chin-Hui
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (08) : 1289 - 1300
[6] Maximum-a-Posteriori-Based Decoding for End-to-End Acoustic Models
Kanda, Naoyuki
Lu, Xugang
Kawai, Hisashi
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 1023 - 1034
[7] End-to-End Learning-Based Image Compression: A Review
Chen Jimin
Lin Zehao
LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (22)
[8] Amharic OCR: An End-to-End Learning
Belay, Birhanu
Habtegebrial, Tewodros
Meshesha, Million
Liwicki, Marcus
Belay, Gebeyehu
Stricker, Didier
APPLIED SCIENCES-BASEL, 2020, 10 (03):
[9] Impact of Aliasing on Deep CNN-Based End-to-End Acoustic Models
Gong, Yuan
Poellabauer, Christian
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2698 - 2702
[10] Incremental End-to-End Learning for Lateral Control in Autonomous Driving
Kwon, Jaerock
Khalil, Aws
Kim, Donghyun
Nam, Haewoon
IEEE ACCESS, 2022, 10 : 33771 - 33786

← 1 2 3 4 5 →