Estimating vocal tract geometry from acoustic impedance using deep neural network

被引:0
作者
Balamurali, B. T. [1 ]
Kapoor, Saumitra [1 ]
Chen, Jer-Ming [1 ]
机构
[1] Singapore Univ Technol & Design, Singapore, Singapore
来源
JASA EXPRESS LETTERS | 2022年 / 2卷 / 03期
关键词
AREA; RESONANCE;
D O I
10.1121/10.0009599
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A data-driven approach using artificial neural networks is proposed to address the classic inverse area function problem, i.e., to determine the vocal tract geometry (modelled as a tube of nonuniform cylindrical cross-sections) from the vocal tract acoustic impedance spectrum. The predicted cylindrical radii and the actual radii were found to have high correlation in the three- and four-cylinder model (Pearson coefficient (rho) and Lin concordance coefficient (rho(c)) exceeded 95%); however, for the six-cylinder model, the correlation was low (rho around 75% and rho(c) around 69%). Upon standardizing the impedance value, the correlation improved significantly for all cases (rho and rho(c) exceeded 90%). (C) 2022 Author(s).
引用
收藏
页数:8
相关论文
共 35 条
[1]   Solving ill-posed inverse problems using iterative deep neural networks [J].
Adler, Jonas ;
Oktem, Ozan .
INVERSE PROBLEMS, 2017, 33 (12)
[2]  
[Anonymous], 2013, P SPEECH PRODUCTION
[3]   Artificial neural networks: fundamentals, computing, design, and application [J].
Basheer, IA ;
Hajmeer, M .
JOURNAL OF MICROBIOLOGICAL METHODS, 2000, 43 (01) :3-31
[4]  
Chollet F., 2015, KERAS
[5]  
Dusan S., 1998, INT C SPOKEN LANGUAG
[6]   Improvement in Neural Respiratory Drive Estimation From Diaphragm Electromyographic Signals Using Fixed Sample Entropy [J].
Estrada, Luis ;
Torres, Abel ;
Sarlabous, Leonardo ;
Jane, Raimon .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2016, 20 (02) :476-485
[7]  
Fant G, 1971, AcousticTheory of Speech Production: With Calculations Based on X-Ray Studies of Russian Articulations (no. 2), DOI DOI 10.1515/9783110873429
[8]  
Fletcher N.H., 2012, The Physics of Musical Instruments
[9]  
Gant V., 2001, Clinical Applications of Artificial Neural Networks
[10]   Vocal tract adjustments in the high soprano range [J].
Garnier, Maeva ;
Henrich, Nathalie ;
Smith, John ;
Wolfe, Joe .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (06) :3771-3780