A stochastic model of voice generation and the corresponding solution for the inverse problem using Artificial Neural Network for case with pathology in the vocal folds

被引:3
作者
Cataldo, E. [1 ]
Soize, C. [2 ]
机构
[1] Univ Fed Fluminense, Grad Program Elect & Telecommun Engn, Rua Mario Santos Braga S-N, BR-24020140 Niteroi, RJ, Brazil
[2] Univ Gustave Eiffel, Lab Modelisat & Simulat Multi Echelle, MSME UMR 8208, CNRS, 5 Bd Descartes, F-77454 Marne La Vallee, France
关键词
Voice production; Jitter; Stochastic biomechanical models; Voice pathologies; OSCILLATION; JITTER;
D O I
10.1016/j.bspc.2021.102623
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
A novel stochastic model to produce voiced sounds is proposed and, mainly, the corresponding identification of some model parameters using an Artificial Neural Network (ANN). The procedure described in this paper is about an intermediate step, which has as final objective to identify pathologies in the vocal folds through the voice of patients, that is, through a non-invasive method. The proposed model presented here uses the source-filter Fant theory and three main novelties are presented: a new mathematical model to produce voice obtained from the unification of two other deterministic one mass-spring-damper models obtained from the literature; a stochastic model that can generate and control the level of jitter resulting even in hoarse voice signals and/or with pathological characteristics but using a simpler model than those usually discussed in the literature; and the most important novelty, the identification of parameters of the proposed model, from experimental voice signals, using an ANN, particularly in a pathological case. The proposed neural network-based identification method requires a construction of a database from which an ANN can be trained to learn the nonlinear relationship between the parameters of the stochastic model and some relevant quantities of interest. The corresponding inverse stochastic problem is then solved in two cases: for one utterance corresponding to a normal voice and for another utterance corresponding to a pathological case corresponding to a nodulus in the vocal folds, helping to validate the model.
引用
收藏
页数:8
相关论文
共 24 条
  • [1] [Anonymous], 1997, Applied Smoothing Techniques for Data Analysis: The Kernel Approach with S-Plus Illustrations
  • [2] Stochastic mechanical model of vocal folds for producing jitter and for identifying pathologies through real voices
    Cataldo, E.
    Soize, C.
    [J]. JOURNAL OF BIOMECHANICS, 2018, 74 : 126 - 133
  • [3] Jitter generation in voice signals produced by a two-mass stochastic mechanical model
    Cataldo, E.
    Soize, C.
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2016, 27 : 87 - 95
  • [4] FANT G, 1963, ACOUSTIC THEORY SPEE
  • [5] Physical simulation of laryngeal disorders using a multiple-mass vocal fold model
    Fraile, Ruben
    Kob, Malte
    Godino-Llorente, Juan I.
    Saenz-Lechon, Nicolas
    Osma-Ruiz, Victor J.
    Gutierrez-Arriola, Juana M.
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2012, 7 (01) : 65 - 78
  • [6] Laje R, 2001, PHYS REV, V64
  • [7] Theoretical study of the hysteresis phenomenon at vocal fold oscillation onset-offset
    Lucero, JC
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 105 (01) : 423 - 431
  • [8] Phonation threshold pressure at large asymmetries of the vocal folds
    Lucero, Jorge C.
    Pelorson, Xavier
    Van Hirtum, Annemie
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 62
  • [9] A lumped mucosal wave model of the vocal folds revisited: Recent extensions and oscillation hysteresis
    Lucero, Jorge C.
    Koenig, Laura L.
    Lourenco, Kelem G.
    Ruty, Nicolas
    Pelorson, Xavier
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 129 (03) : 1568 - 1579
  • [10] A multipurpose user-friendly tool for voice analysis: Application to pathological adult voices
    Manfredi, Claudia
    Bocchi, Leonardo
    Cantarella, Giovanna
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2009, 4 (03) : 212 - 220