A stochastic model of voice generation and the corresponding solution for the inverse problem using Artificial Neural Network for case with pathology in the vocal folds

被引：3

作者：

Cataldo, E. ^{[1
]}

Soize, C. ^{[2
]}

机构：

[1] Univ Fed Fluminense, Grad Program Elect & Telecommun Engn, Rua Mario Santos Braga S-N, BR-24020140 Niteroi, RJ, Brazil

[2] Univ Gustave Eiffel, Lab Modelisat & Simulat Multi Echelle, MSME UMR 8208, CNRS, 5 Bd Descartes, F-77454 Marne La Vallee, France

来源：

BIOMEDICAL SIGNAL PROCESSING AND CONTROL | 2021年 / 68卷

关键词：

Voice production; Jitter; Stochastic biomechanical models; Voice pathologies; OSCILLATION; JITTER;

D O I：

10.1016/j.bspc.2021.102623

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

A novel stochastic model to produce voiced sounds is proposed and, mainly, the corresponding identification of some model parameters using an Artificial Neural Network (ANN). The procedure described in this paper is about an intermediate step, which has as final objective to identify pathologies in the vocal folds through the voice of patients, that is, through a non-invasive method. The proposed model presented here uses the source-filter Fant theory and three main novelties are presented: a new mathematical model to produce voice obtained from the unification of two other deterministic one mass-spring-damper models obtained from the literature; a stochastic model that can generate and control the level of jitter resulting even in hoarse voice signals and/or with pathological characteristics but using a simpler model than those usually discussed in the literature; and the most important novelty, the identification of parameters of the proposed model, from experimental voice signals, using an ANN, particularly in a pathological case. The proposed neural network-based identification method requires a construction of a database from which an ANN can be trained to learn the nonlinear relationship between the parameters of the stochastic model and some relevant quantities of interest. The corresponding inverse stochastic problem is then solved in two cases: for one utterance corresponding to a normal voice and for another utterance corresponding to a pathological case corresponding to a nodulus in the vocal folds, helping to validate the model.

引用

页数：8

共 24 条

[1] [Anonymous], 1997, Applied Smoothing Techniques for Data Analysis: The Kernel Approach with S-Plus Illustrations
[2] Stochastic mechanical model of vocal folds for producing jitter and for identifying pathologies through real voices
Cataldo, E.
Soize, C.
[J]. JOURNAL OF BIOMECHANICS, 2018, 74 : 126 - 133
[3] Jitter generation in voice signals produced by a two-mass stochastic mechanical model
Cataldo, E.
Soize, C.
[J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2016, 27 : 87 - 95
[4] FANT G, 1963, ACOUSTIC THEORY SPEE
[5] Physical simulation of laryngeal disorders using a multiple-mass vocal fold model
Fraile, Ruben
Kob, Malte
Godino-Llorente, Juan I.
Saenz-Lechon, Nicolas
Osma-Ruiz, Victor J.
Gutierrez-Arriola, Juana M.
[J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2012, 7 (01) : 65 - 78
[6] Laje R, 2001, PHYS REV, V64
[7] Theoretical study of the hysteresis phenomenon at vocal fold oscillation onset-offset
Lucero, JC
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 105 (01) : 423 - 431
[8] Phonation threshold pressure at large asymmetries of the vocal folds
Lucero, Jorge C.
Pelorson, Xavier
Van Hirtum, Annemie
[J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 62
[9] A lumped mucosal wave model of the vocal folds revisited: Recent extensions and oscillation hysteresis
Lucero, Jorge C.
Koenig, Laura L.
Lourenco, Kelem G.
Ruty, Nicolas
Pelorson, Xavier
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 129 (03) : 1568 - 1579
[10] A multipurpose user-friendly tool for voice analysis: Application to pathological adult voices
Manfredi, Claudia
Bocchi, Leonardo
Cantarella, Giovanna
[J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2009, 4 (03) : 212 - 220

← 1 2 3 →