Synthesis of voiced sounds using physics-informed neural networks

被引:0
作者
Yokota, Kazuya [1 ]
Ogura, Masataka [2 ]
Abe, Masajiro [3 ]
机构
[1] Nagaoka Univ Technol, Dept Mech Engn, 1603-1 Kamitomioka, Nagaoka 9402188, Japan
[2] Nagaoka Univ Technol, Ctr Integrated Technol Support, 1603-1 Kamitomioka, Nagaoka 9402188, Japan
[3] Nagaoka Univ Technol, Dept Syst Safety Engn, 1603-1 Kamitomioka, Nagaoka 9402188, Japan
关键词
Physics-informed neural networks; PINNs; Vocal tract; Voiced sounds; Glottal inverse filtering; MODEL;
D O I
10.1250/ast.e24.55
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently, physics-informed neural networks (PINNs) have garnered attention for use as a numerical simulation method for inverse analysis, such as property identification. However, studies on PINNs for conducting acoustic analysis are scarce. Thus, this study developed PINNs that performed acoustic analysis of the vocal tract and synthesized voiced sounds. In addition, PINNs were used to identify glottal source waveforms. Consequently, PINNs were demonstrated to be a promising solution for the inverse problem related to speech production.
引用
收藏
页码:333 / 336
页数:4
相关论文
共 12 条
[1]   Education system in acoustics of speech production using physical models of the human vocal tract [J].
Arai, Takayuki .
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2007, 28 (03) :190-201
[2]   Physics-Informed Neural Networks for Heat Transfer Problems [J].
Cai, Shengze ;
Wang, Zhicheng ;
Wang, Sifan ;
Perdikaris, Paris ;
Karniadakis, George E. M. .
JOURNAL OF HEAT TRANSFER-TRANSACTIONS OF THE ASME, 2021, 143 (06)
[3]  
Flanagan J. L., 2013, Speech analysis synthesis and perception, V3
[4]   SYNTHESIS OF VOICED SOUNDS FROM A 2-MASS MODEL OF VOCAL CORDS [J].
ISHIZAKA, K ;
FLANAGAN, JL .
BELL SYSTEM TECHNICAL JOURNAL, 1972, 51 (06) :1233-+
[5]   Static measurements of vowel formant frequencies and bandwidths: A review [J].
Kent, Raymond D. ;
Vorperian, Houri K. .
JOURNAL OF COMMUNICATION DISORDERS, 2018, 74 :74-97
[6]   Physics-informed neural networks for high-speed flows [J].
Mao, Zhiping ;
Jagtap, Ameya D. ;
Karniadakis, George Em .
COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2020, 360
[7]  
Moseley B, 2020, Arxiv, DOI [arXiv:2006.11894, DOI 10.48550/ARXIV.2006.11894]
[8]   Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations [J].
Raissi, M. ;
Perdikaris, P. ;
Karniadakis, G. E. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2019, 378 :686-707
[9]   EFFECT OF GLOTTAL PULSE SHAPE ON QUALITY OF NATURAL VOWELS [J].
ROSENBERG, AE .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 49 (02) :583-+
[10]   DIRECT ESTIMATION OF VOCAL-TRACT SHAPE BY INVERSE FILTERING OF ACOUSTIC SPEECH WAVEFORMS [J].
WAKITA, H .
IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1973, AU21 (05) :417-427