Screening Voice Disorders: Acoustic Voice Quality Index, Cepstral Peak Prominence, and Machine Learning

被引:0
|
作者
Yousef, Ahmed M. [1 ,2 ,3 ]
Castillo-Allendes, Adrian [3 ,4 ]
Berardi, Mark L. [3 ]
Codino, Juliana [5 ]
Rubin, Adam D. [5 ]
Hunter, Eric J. [3 ]
机构
[1] Massachusetts Gen Hosp, Ctr Laryngeal Surg & Voice Rehabil, Boston, MA 02114 USA
[2] Harvard Med Sch, Dept Surg, Boston, MA 02115 USA
[3] Univ Iowa, Dept Commun Sci & Disorders, Iowa City, IA 52242 USA
[4] Michigan State Univ, Dept Commun Sci & Disorders, E Lansing, MI USA
[5] Lakeshore Profess Voice Ctr, Lakeshore Ear Nose & Throat Ctr, St Clair Shores, MI USA
基金
美国国家卫生研究院;
关键词
Voice disorders; Machine learning; Speech acoustics; Acoustic Voice Quality Index; Cepstral Peak Prominence; DYSPHONIA SEVERITY; SPEECH; VALIDATION; PITCH;
D O I
10.1159/000544852
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Introduction: The Acoustic Voice Quality Index (AVQI) and Smoothed Cepstral Peak Prominence (CPPs) have been reported to effectively support the assessment of voice quality in persons seeking voice care across many languages. This study aimed to evaluate the diagnostic accuracy of these two measures in detecting voice disorders in American English speakers, comparing their performance to machine learning (ML) models. Methods: This retrospective study included a cohort of 187 participants: 138 patients with clinically diagnosed voice disorders and 49 vocally healthy individuals. Each participant completed two voicing tasks: sustaining [a:] vowel and producing a running speech sample, which were then concatenated. These samples were analyzed using VOXplot software for AVQI-3 (version 03.01) and CPPs. Additionally, four ML models (random forest, k-nearest neighbors, support vector machine, and decision tree) were trained for comparison. The diagnostic accuracy of the two measures and models was assessed using various evaluation metrics, including receiver operating characteristic curve and Youden Index. Results: A cutoff score of 1.54 for the AVQI-3 (with 55% sensitivity and 80% specificity) and 14.35 dB for CPPs (with 65% sensitivity and 78% specificity) were identified for detecting voice disorders. Compared to an average ML sensitivity of 89% and specificity of 55%, CPPs offered a better balance between sensitivity and specificity, outperforming AVQI-3 and nearly matching the average ML performance. Conclusions: ML shows great potential for supporting voice disorder diagnostics, especially as models become more generalizable and easier to interpret. However, current tools like AVQI-3 and CPPs remain more practical and accessible for clinical use in evaluating voice quality than commonly implemented models. CPPs, in particular, offers distinct advantages for identifying voice disorders, making it a recommended and feasible choice for clinics with limited resources.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] A Comprehensive Application for Grading Severity of Voice Based on Acoustic Voice Quality Index v.02.03
    Shabnam, Srushti
    Pushpavathi, M.
    Sankar, R. Gopi
    Sridharan, Karthik Venkat
    Vasanthalakshmi, M. S.
    JOURNAL OF VOICE, 2025, 39 (01) : 287e1 - 287e9
  • [22] Predicting Voice Disorder Status From Smoothed Measures of Cepstral Peak Prominence Using Praat and Analysis of Dysphonia in Speech and Voice (ADSV)
    Sauder, Cara
    Bretl, Michelle
    Eadie, Tanya
    JOURNAL OF VOICE, 2017, 31 (05) : 557 - 566
  • [23] The Performance of the Acoustic Voice Quality Index and Acoustic Breathiness Index in Synthesized Voices
    von Latoszek, Ben Barsties
    Englert, Marina
    Lucero, Jorge C.
    Behlau, Mara
    JOURNAL OF VOICE, 2023, 37 (05) : 804.e21 - 804.e28
  • [24] A Case of Specificity: How Does the Acoustic Voice Quality Index Perform in Normophonic Subjects?
    Batthyany, Christina
    Maryn, Youri
    Trauwaen, Ilse
    Caelenberghe, Els
    van Dinther, Joost
    Zarowski, Andrzej
    Wuyts, Floris
    APPLIED SCIENCES-BASEL, 2019, 9 (12):
  • [25] Exploring the Relationship Between Spectral and Cepstral Measures of Voice and the Voice Handicap Index (VHI)
    Awan, Shaheen N.
    Roy, Nelson
    Cohen, Seth M.
    JOURNAL OF VOICE, 2014, 28 (04) : 430 - 439
  • [26] What Makes the Cepstral Peak Prominence Different to Other Acoustic Correlates of Vocal Quality?
    Riesgo, Carlos A. Ferrer
    Noeth, Elmar
    JOURNAL OF VOICE, 2020, 34 (05)
  • [27] Validation of the Acoustic Voice Quality Index Version 03.01 and Acoustic Breathiness Index in German
    Latoszek, Ben Barsties, V
    Lehnert, Bernhard
    Janotte, Ben
    JOURNAL OF VOICE, 2020, 34 (01) : 157.e17 - 157.e25
  • [28] Influence of the Voice Sample Length in Perceptual and Acoustic Voice Quality Analysis
    Englert, Marina
    Lima, Livia
    Latoszek, Ben Barsties, V
    Behlau, Mara
    JOURNAL OF VOICE, 2022, 36 (04) : 582.e23 - 582.e32
  • [29] Development of a machine-learning based voice disorder screening tool
    Reid, Jonathan
    Parmar, Preet
    Lund, Tyler
    Aalto, Daniel K.
    Jeffery, Caroline C.
    AMERICAN JOURNAL OF OTOLARYNGOLOGY, 2022, 43 (02)
  • [30] Cepstral measures in the assessment of severity of voice disorders
    Lopes, Leonardo Wanderley
    da Silva Sousa, Estevao Silvestre
    Franca da Silva, Allan Carlos
    da Silva, Itacely Marinho
    Alves de Paiva, Maxsuel Avelino
    Dias Vieira, Vinicius Jefferson
    Almeida, Anna Alice
    CODAS, 2019, 31 (04):