Machine learning based identification of relevant parameters for functional voice disorders derived from endoscopic high-speed recordings

被引:27
作者
Schlegel, Patrick [1 ]
Kniesburges, Stefan [1 ]
Duerr, Stephan [1 ]
Schuetzenberger, Anne [1 ]
Doellinger, Michael [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Univ Hosp Erlangen, Dept Otorhinolaryngol, Div Phoniatr & Pediat Audiol, Erlangen, Germany
关键词
VOCAL FOLD VIBRATIONS; TO-NOISE RATIO; SPATIOTEMPORAL ANALYSIS; DYNAMICS; CLASSIFICATION; RECONSTRUCTION; HEALTHY; VARIABILITY; DYSPHONIA; FEATURES;
D O I
10.1038/s41598-020-66405-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In voice research and clinical assessment, many objective parameters are in use. However, there is no commonly used set of parameters that reflect certain voice disorders, such as functional dysphonia (FD); i.e. disorders with no visible anatomical changes. Hence, 358 high-speed videoendoscopy (HSV) recordings (159 normal females (N-F), 101 FD females (FDF), 66 normal males (N-M), 32 FD males (FDM)) were analyzed. We investigated 91 quantitative HSV parameters towards their significance. First, 25 highly correlated parameters were discarded. Second, further 54 parameters were discarded by using a LogitBoost decision stumps approach. This yielded a subset of 12 parameters sufficient to reflect functional dysphonia. These parameters separated groups N-F vs. FDF and N-M vs. FDM with fair accuracy of 0.745 or 0.768, respectively. Parameters solely computed from the changing glottal area waveform (1D-function called GAW) between the vocal folds were less important than parameters describing the oscillation characteristics along the vocal folds (2D-function called Phonovibrogram). Regularity of GAW phases and peak shape, harmonic structure and Phonovibrogram-based vocal fold open and closing angles were mainly important. This study showed the high degree of redundancy of HSV-voice-parameters but also affirms the need of multidimensional based assessment of clinical data.
引用
收藏
页数:14
相关论文
共 67 条
  • [1] [Anonymous], 2006, PAPER PRESENTED AT T
  • [2] [Anonymous], 1973, Studia phonologica
  • [3] [Anonymous], 3 EUR C SPEECH COMM
  • [5] Acoustic prediction of voice type in women with functional dysphonia
    Awan, SN
    Roy, N
    [J]. JOURNAL OF VOICE, 2005, 19 (02) : 268 - 282
  • [6] Automated setup for ex vivo larynx experiments
    Birk, Veronika
    Doellinger, Michael
    Sutor, Alexander
    Berry, David A.
    Gedeon, Dominik
    Traxdorf, Maximilian
    Wendler, Olaf
    Bohr, Christopher
    Kniesburges, Stefan
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (03) : 1349 - 1359
  • [7] Spatiotemporal Analysis of High-Speed Videolaryngoscopic Imaging of Organic Pathologies in Males
    Bohr, Christopher
    Kraeck, Angelika
    Dubrovskiy, Denis
    Eysholdt, Ulrich
    Svec, Jan
    Psychogios, Georgios
    Ziethe, Anke
    Doellinger, Michael
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2014, 57 (04): : 1148 - 1161
  • [8] Vocal Fold Phase Asymmetries in Patients With Voice Disorders: A Study Across Visualization Techniques
    Bonilha, Heather Shaw
    Deliyski, Dimitar D.
    Whiteside, Joanna Piasecki
    Gerlach, Terri Treman
    [J]. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY, 2012, 21 (01) : 3 - 15
  • [9] Self-organizing map for the classification of normal and disordered female voices
    Callan, DE
    Kent, RD
    Roy, N
    Tasko, SM
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 1999, 42 (02): : 355 - 366
  • [10] Development of a glottal area index that integrates glottal gap size and open quotient
    Chen, Gang
    Kreiman, Jody
    Gerratt, Bruce R.
    Neubauer, Juergen
    Shue, Yen-Liang
    Alwan, Abeer
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (03) : 1656 - 1666