Synthesis of voiced sounds using physics-informed neural networks

被引:0
作者
Yokota, Kazuya [1 ]
Ogura, Masataka [2 ]
Abe, Masajiro [3 ]
机构
[1] Nagaoka Univ Technol, Dept Mech Engn, 1603-1 Kamitomioka, Nagaoka 9402188, Japan
[2] Nagaoka Univ Technol, Ctr Integrated Technol Support, 1603-1 Kamitomioka, Nagaoka 9402188, Japan
[3] Nagaoka Univ Technol, Dept Syst Safety Engn, 1603-1 Kamitomioka, Nagaoka 9402188, Japan
关键词
Physics-informed neural networks; PINNs; Vocal tract; Voiced sounds; Glottal inverse filtering; MODEL;
D O I
10.1250/ast.e24.55
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently, physics-informed neural networks (PINNs) have garnered attention for use as a numerical simulation method for inverse analysis, such as property identification. However, studies on PINNs for conducting acoustic analysis are scarce. Thus, this study developed PINNs that performed acoustic analysis of the vocal tract and synthesized voiced sounds. In addition, PINNs were used to identify glottal source waveforms. Consequently, PINNs were demonstrated to be a promising solution for the inverse problem related to speech production.
引用
收藏
页码:333 / 336
页数:4
相关论文
共 12 条