Leveraging large language models for automated detection of velopharyngeal dysfunction in patients with cleft palate

被引:0
作者
Shirk, Myranda Uselton [1 ]
Dang, Catherine [1 ]
Cho, Jaewoo [1 ]
Chen, Hanlin [1 ]
Hofstetter, Lily [1 ]
Bijur, Jack [1 ]
Lucas, Claiborne [2 ]
James, Andrew [3 ]
Guzman, Ricardo-Torres [3 ]
Hiller, Andrea [3 ]
Alter, Noah [3 ]
Stone, Amy [4 ]
Powell, Maria [4 ]
Pontell, Matthew E. [3 ,5 ]
机构
[1] Vanderbilt Univ, Data Sci Inst, Nashville, TN USA
[2] Prisma Hlth Greenville, Dept Gen Surg, Greenville, SC USA
[3] Vanderbilt Univ, Med Ctr, Dept Plast Surg, Nashville, TN 37232 USA
[4] Vanderbilt Univ, Med Ctr, Dept Otolaryngol, Nashville, TN USA
[5] Monroe Carell Jr Childrens Hosp, Div Pediat Plast Surg, Nashville, TN 37232 USA
来源
FRONTIERS IN DIGITAL HEALTH | 2025年 / 7卷
关键词
velopharyngeal dysfunction (VPD); hypernasality detection; artificial intelligence (AI); cleft palate; machine learning (ML); speech diagnostics; QUALITY-OF-LIFE; HEALTH-CARE; INSUFFICIENCY; ASSOCIATION; GENETICS; CHILDREN;
D O I
10.3389/fdgth.2025.1552746
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background Hypernasality, a hallmark of velopharyngeal insufficiency (VPI), is a speech disorder with significant psychosocial and functional implications. Conventional diagnostic methods rely heavily on specialized expertise and equipment, posing challenges in resource-limited settings. This study explores the application of OpenAI's Whisper model for automated hypernasality detection, offering a scalable and efficient alternative to traditional approaches.Methods The Whisper model was adapted for binary classification by replacing its sequence-to-sequence decoder with a custom classification head. A dataset of 184 audio recordings, including 96 hypernasal (cases) and 88 non-hypernasal samples (controls), was used for training and evaluation. The Whisper model's performance was compared to traditional machine learning approaches, including support vector machines (SVM) and random forest (RF) classifiers.Results The Whisper-based model effectively detected hypernasality in speech, achieving a test accuracy of 97% and an F1-score of 0.97. It significantly outperformed SVM and RF classifiers, which achieved accuracies of 88.1% and 85.7%, respectively. Whisper demonstrated robust performance across diverse recording conditions and required minimal training data, showcasing its scalability and efficiency for hypernasality detection.Conclusion This study demonstrates the effectiveness of the Whisper-based model for hypernasality detection. By providing a reliable pretest probability, the Whisper model can serve as a triaging mechanism to prioritize patients for further evaluation, reducing diagnostic delays and optimizing resource allocation.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Analysis of patients with a cleft of the soft palate with special consideration to the problem of velopharyngeal insufficiency
    Schuster, T.
    Rustemeyer, J.
    Bremerich, A.
    Guenther, L.
    Schwenzer-Zimmerer, K.
    JOURNAL OF CRANIO-MAXILLOFACIAL SURGERY, 2013, 41 (03) : 245 - 248
  • [32] Relationship of Velopharyngeal Insufficiency With Face Mask Therapy in Patients With Cleft Lip and Palate
    Helal, Narmin
    Ford, Matthew
    Basri, Osama
    Schuster, Lindsay
    Martin, Brian
    Losee, Joseph
    CLEFT PALATE-CRANIOFACIAL JOURNAL, 2020, 57 (01) : 118 - 122
  • [33] Velopharyngeal Outcomes After Palatoplasty for Patients With U-Shaped Cleft Palate and Pierre Robin Sequence
    Long, Katherine D.
    Lee, Koeun
    Ji, Jenny
    Skolnick, Gary B.
    Grames, Lynn M.
    Zanaboni, Hope
    Stephens, Kathryn
    Naidoo, Sybill D.
    Snyder-Warwick, Alison K.
    Patel, Kamlesh B.
    FACE, 2025,
  • [34] Update on using buccal myomucosal flaps for patients with cleft palate and velopharyngeal insufficiency: primary and secondary interventions
    Marston, Alexander P.
    Tollefson, Travis T.
    CURRENT OPINION IN OTOLARYNGOLOGY & HEAD AND NECK SURGERY, 2024, 32 (04) : 239 - 247
  • [35] Three-dimensional analysis of the velopharyngeal region in patients with cleft palate and healthy individuals
    Miller, Simone
    Neuhaus, Michael-Tobias
    Zimmerer, Ruediger
    Tavassol, Frank
    Gellrich, Nils-Claudius
    Ptok, Martin
    Jungheim, Michael
    SURGICAL AND RADIOLOGIC ANATOMY, 2020, 42 (09) : 1033 - 1042
  • [36] A Preliminary Study on the Characteristics of the Velopharyngeal Structures in Different-Age Patients With Cleft Palate
    Ma, Li
    Shi, Bing
    Li, Yang
    Zheng, Qian
    JOURNAL OF CRANIOFACIAL SURGERY, 2013, 24 (04) : 1235 - 1238
  • [37] Analysis of the correlative factors for velopharyngeal closure of patients with cleft palate after primary repair
    Chen, Qi
    Li, Yang
    Shi, Bing
    Yin, Heng
    Zheng, Guang-Ning
    Zheng, Qian
    ORAL SURGERY ORAL MEDICINE ORAL PATHOLOGY ORAL RADIOLOGY, 2013, 116 (06): : E424 - E428
  • [38] Speech Outcomes Comparison Between Adult Velopharyngeal Insufficiency and Patients With Unrepaired Cleft Palate
    Lou, Qun
    Wang, Xudong
    Chen, Yang
    JOURNAL OF CRANIOFACIAL SURGERY, 2021, 32 (02) : 655 - 659
  • [39] Three-dimensional analysis of the velopharyngeal region in patients with cleft palate and healthy individuals
    Simone Miller
    Michael-Tobias Neuhaus
    Rüdiger Zimmerer
    Frank Tavassol
    Nils-Claudius Gellrich
    Martin Ptok
    Michael Jungheim
    Surgical and Radiologic Anatomy, 2020, 42 : 1033 - 1042
  • [40] The Correlation Between Consonant Articulation and Velopharyngeal Function in Patients With Unoperated Submucous Cleft Palate
    Zhang, Bei
    Guo, Chunli
    Yin, Heng
    Zheng, Qian
    Shi, Bing
    Li, Jingtao
    JOURNAL OF CRANIOFACIAL SURGERY, 2020, 31 (04) : 1070 - 1073