Machine learning based identification of relevant parameters for functional voice disorders derived from endoscopic high-speed recordings

被引:28
作者
Schlegel, Patrick [1 ]
Kniesburges, Stefan [1 ]
Duerr, Stephan [1 ]
Schuetzenberger, Anne [1 ]
Doellinger, Michael [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Univ Hosp Erlangen, Dept Otorhinolaryngol, Div Phoniatr & Pediat Audiol, Erlangen, Germany
关键词
VOCAL FOLD VIBRATIONS; TO-NOISE RATIO; SPATIOTEMPORAL ANALYSIS; DYNAMICS; CLASSIFICATION; RECONSTRUCTION; HEALTHY; VARIABILITY; DYSPHONIA; FEATURES;
D O I
10.1038/s41598-020-66405-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In voice research and clinical assessment, many objective parameters are in use. However, there is no commonly used set of parameters that reflect certain voice disorders, such as functional dysphonia (FD); i.e. disorders with no visible anatomical changes. Hence, 358 high-speed videoendoscopy (HSV) recordings (159 normal females (N-F), 101 FD females (FDF), 66 normal males (N-M), 32 FD males (FDM)) were analyzed. We investigated 91 quantitative HSV parameters towards their significance. First, 25 highly correlated parameters were discarded. Second, further 54 parameters were discarded by using a LogitBoost decision stumps approach. This yielded a subset of 12 parameters sufficient to reflect functional dysphonia. These parameters separated groups N-F vs. FDF and N-M vs. FDM with fair accuracy of 0.745 or 0.768, respectively. Parameters solely computed from the changing glottal area waveform (1D-function called GAW) between the vocal folds were less important than parameters describing the oscillation characteristics along the vocal folds (2D-function called Phonovibrogram). Regularity of GAW phases and peak shape, harmonic structure and Phonovibrogram-based vocal fold open and closing angles were mainly important. This study showed the high degree of redundancy of HSV-voice-parameters but also affirms the need of multidimensional based assessment of clinical data.
引用
收藏
页数:14
相关论文
共 67 条
[1]  
[Anonymous], 3 EUR C SPEECH COMM
[3]   Acoustic prediction of voice type in women with functional dysphonia [J].
Awan, SN ;
Roy, N .
JOURNAL OF VOICE, 2005, 19 (02) :268-282
[4]   Automated setup for ex vivo larynx experiments [J].
Birk, Veronika ;
Doellinger, Michael ;
Sutor, Alexander ;
Berry, David A. ;
Gedeon, Dominik ;
Traxdorf, Maximilian ;
Wendler, Olaf ;
Bohr, Christopher ;
Kniesburges, Stefan .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (03) :1349-1359
[5]   Spatiotemporal Analysis of High-Speed Videolaryngoscopic Imaging of Organic Pathologies in Males [J].
Bohr, Christopher ;
Kraeck, Angelika ;
Dubrovskiy, Denis ;
Eysholdt, Ulrich ;
Svec, Jan ;
Psychogios, Georgios ;
Ziethe, Anke ;
Doellinger, Michael .
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2014, 57 (04) :1148-1161
[6]   Vocal Fold Phase Asymmetries in Patients With Voice Disorders: A Study Across Visualization Techniques [J].
Bonilha, Heather Shaw ;
Deliyski, Dimitar D. ;
Whiteside, Joanna Piasecki ;
Gerlach, Terri Treman .
AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY, 2012, 21 (01) :3-15
[7]   Self-organizing map for the classification of normal and disordered female voices [J].
Callan, DE ;
Kent, RD ;
Roy, N ;
Tasko, SM .
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 1999, 42 (02) :355-366
[8]  
Caruana R., 2006, PAPER PRESENTED AT T
[9]   Development of a glottal area index that integrates glottal gap size and open quotient [J].
Chen, Gang ;
Kreiman, Jody ;
Gerratt, Bruce R. ;
Neubauer, Juergen ;
Shue, Yen-Liang ;
Alwan, Abeer .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (03) :1656-1666
[10]   Hierarchical Classification and System Combination for Automatically Identifying Physiological and Neuromuscular Laryngeal Pathologies [J].
Cordeiro, Hugo ;
Fonseca, Jose ;
Guimaraes, Isabel ;
Meneses, Carlos .
JOURNAL OF VOICE, 2017, 31 (03) :384.e9-384.e14