Screening Voice Disorders: Acoustic Voice Quality Index, Cepstral Peak Prominence, and Machine Learning

被引:0
|
作者
Yousef, Ahmed M. [1 ,2 ,3 ]
Castillo-Allendes, Adrian [3 ,4 ]
Berardi, Mark L. [3 ]
Codino, Juliana [5 ]
Rubin, Adam D. [5 ]
Hunter, Eric J. [3 ]
机构
[1] Massachusetts Gen Hosp, Ctr Laryngeal Surg & Voice Rehabil, Boston, MA 02114 USA
[2] Harvard Med Sch, Dept Surg, Boston, MA 02115 USA
[3] Univ Iowa, Dept Commun Sci & Disorders, Iowa City, IA 52242 USA
[4] Michigan State Univ, Dept Commun Sci & Disorders, E Lansing, MI USA
[5] Lakeshore Profess Voice Ctr, Lakeshore Ear Nose & Throat Ctr, St Clair Shores, MI USA
基金
美国国家卫生研究院;
关键词
Voice disorders; Machine learning; Speech acoustics; Acoustic Voice Quality Index; Cepstral Peak Prominence; DYSPHONIA SEVERITY; SPEECH; VALIDATION; PITCH;
D O I
10.1159/000544852
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Introduction: The Acoustic Voice Quality Index (AVQI) and Smoothed Cepstral Peak Prominence (CPPs) have been reported to effectively support the assessment of voice quality in persons seeking voice care across many languages. This study aimed to evaluate the diagnostic accuracy of these two measures in detecting voice disorders in American English speakers, comparing their performance to machine learning (ML) models. Methods: This retrospective study included a cohort of 187 participants: 138 patients with clinically diagnosed voice disorders and 49 vocally healthy individuals. Each participant completed two voicing tasks: sustaining [a:] vowel and producing a running speech sample, which were then concatenated. These samples were analyzed using VOXplot software for AVQI-3 (version 03.01) and CPPs. Additionally, four ML models (random forest, k-nearest neighbors, support vector machine, and decision tree) were trained for comparison. The diagnostic accuracy of the two measures and models was assessed using various evaluation metrics, including receiver operating characteristic curve and Youden Index. Results: A cutoff score of 1.54 for the AVQI-3 (with 55% sensitivity and 80% specificity) and 14.35 dB for CPPs (with 65% sensitivity and 78% specificity) were identified for detecting voice disorders. Compared to an average ML sensitivity of 89% and specificity of 55%, CPPs offered a better balance between sensitivity and specificity, outperforming AVQI-3 and nearly matching the average ML performance. Conclusions: ML shows great potential for supporting voice disorder diagnostics, especially as models become more generalizable and easier to interpret. However, current tools like AVQI-3 and CPPs remain more practical and accessible for clinical use in evaluating voice quality than commonly implemented models. CPPs, in particular, offers distinct advantages for identifying voice disorders, making it a recommended and feasible choice for clinics with limited resources.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Objective Dysphonia Measures in the Program Praat: Smoothed Cepstral Peak Prominence and Acoustic Voice Quality Index
    Maryn, Youri
    Weenink, David
    JOURNAL OF VOICE, 2015, 29 (01) : 35 - 43
  • [2] Cepstral Peak Prominence Values for Clinical Voice Evaluation
    Murton, Olivia
    Hillman, Robert
    Mehta, Daryush
    AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY, 2020, 29 (03) : 1596 - 1607
  • [3] The Minimal Important Difference of Acoustic Voice Quality Index in the Treatment of Voice Disorders
    Hosokawa, Kiyohito
    Iwahashi, Toshihiko
    Iwahashi, Mio
    Iwaki, Shinobu
    Yoshida, Misao
    Kitayama, Itsuki
    Miyauchi, Akira
    Ogawa, Makoto
    Inohara, Hidenori
    LARYNGOSCOPE, 2024, 134 (06) : 2805 - 2811
  • [4] Accuracy of Acoustic Voice Quality Index and Its Isolated Acoustic Measures to Discriminate the Severity of Voice Disorders
    Englert, Marina
    Lopes, Leonardo
    Vieira, Vinicius
    Behlau, Mara
    JOURNAL OF VOICE, 2022, 36 (04) : 582.e1 - 582.e10
  • [5] The Acoustic Voice Quality Index, Version 03.01, in French and the Voice Handicap Index
    Pommee, Timothy
    Maryn, Youri
    Finck, Camille
    Morsomme, Dominique
    JOURNAL OF VOICE, 2020, 34 (04) : 646.e1 - 646.e10
  • [6] The cepstral spectral index of dysphonia, the acoustic voice quality index and the acoustic breathiness index as novel multiparametric indices for acoustic assessment of voice quality
    Barsties V. Latoszek, Ben
    Mathmann, Philipp
    Neumann, Katrin
    CURRENT OPINION IN OTOLARYNGOLOGY & HEAD AND NECK SURGERY, 2021, 29 (06) : 451 - 457
  • [7] Comparison of Two Multiparameter Acoustic Indices of Dysphonia Severity: The Acoustic Voice Quality Index and Cepstral Spectral Index of Dysphonia
    Lee, Jeong Min
    Roy, Nelson
    Peterson, Elizabeth
    Merrill, Ray M.
    JOURNAL OF VOICE, 2018, 32 (04) : 515.e1 - 515.e13
  • [8] Objective Assessment of Pediatric Voice Disorders With the Acoustic Voice Quality Index
    Reynolds, Victoria
    Buckland, Ali
    Bailey, Jean
    Lipscombe, Jodi
    Nathan, Elizabeth
    Vijayasekaran, Shyan
    Kelly, Rona
    Maryn, Youri
    French, Noel
    JOURNAL OF VOICE, 2012, 26 (05) : 672.e1 - 672.e7
  • [9] Exploring the feasibility of the combination of acoustic voice quality index and glottal function index for voice pathology screening
    Ulozaite-Staniene, Nora
    Petrauskas, Tadas
    Saferis, Viktoras
    Uloza, Virgilijus
    EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2019, 276 (06) : 1737 - 1745
  • [10] An iOS-based Cepstral Peak Prominence Application: Feasibility for Patient Practice of Resonant Voice
    van Leer, Eva
    Pfister, Robert C.
    Zhou, Xuefu
    JOURNAL OF VOICE, 2017, 31 (01) : 131.e9 - 131.e16