Leveraging large language models for automated detection of velopharyngeal dysfunction in patients with cleft palate

被引:0
|
作者
Shirk, Myranda Uselton [1 ]
Dang, Catherine [1 ]
Cho, Jaewoo [1 ]
Chen, Hanlin [1 ]
Hofstetter, Lily [1 ]
Bijur, Jack [1 ]
Lucas, Claiborne [2 ]
James, Andrew [3 ]
Guzman, Ricardo-Torres [3 ]
Hiller, Andrea [3 ]
Alter, Noah [3 ]
Stone, Amy [4 ]
Powell, Maria [4 ]
Pontell, Matthew E. [3 ,5 ]
机构
[1] Vanderbilt Univ, Data Sci Inst, Nashville, TN USA
[2] Prisma Hlth Greenville, Dept Gen Surg, Greenville, SC USA
[3] Vanderbilt Univ, Med Ctr, Dept Plast Surg, Nashville, TN 37232 USA
[4] Vanderbilt Univ, Med Ctr, Dept Otolaryngol, Nashville, TN USA
[5] Monroe Carell Jr Childrens Hosp, Div Pediat Plast Surg, Nashville, TN 37232 USA
来源
FRONTIERS IN DIGITAL HEALTH | 2025年 / 7卷
关键词
velopharyngeal dysfunction (VPD); hypernasality detection; artificial intelligence (AI); cleft palate; machine learning (ML); speech diagnostics; QUALITY-OF-LIFE; HEALTH-CARE; INSUFFICIENCY; ASSOCIATION; GENETICS; CHILDREN;
D O I
10.3389/fdgth.2025.1552746
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background Hypernasality, a hallmark of velopharyngeal insufficiency (VPI), is a speech disorder with significant psychosocial and functional implications. Conventional diagnostic methods rely heavily on specialized expertise and equipment, posing challenges in resource-limited settings. This study explores the application of OpenAI's Whisper model for automated hypernasality detection, offering a scalable and efficient alternative to traditional approaches.Methods The Whisper model was adapted for binary classification by replacing its sequence-to-sequence decoder with a custom classification head. A dataset of 184 audio recordings, including 96 hypernasal (cases) and 88 non-hypernasal samples (controls), was used for training and evaluation. The Whisper model's performance was compared to traditional machine learning approaches, including support vector machines (SVM) and random forest (RF) classifiers.Results The Whisper-based model effectively detected hypernasality in speech, achieving a test accuracy of 97% and an F1-score of 0.97. It significantly outperformed SVM and RF classifiers, which achieved accuracies of 88.1% and 85.7%, respectively. Whisper demonstrated robust performance across diverse recording conditions and required minimal training data, showcasing its scalability and efficiency for hypernasality detection.Conclusion This study demonstrates the effectiveness of the Whisper-based model for hypernasality detection. By providing a reliable pretest probability, the Whisper model can serve as a triaging mechanism to prioritize patients for further evaluation, reducing diagnostic delays and optimizing resource allocation.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Different patterns of velopharyngeal dysfunction in cleft palate patients
    Brunner, M.
    Dockter, S.
    Feldhusen, F.
    Proeschel, U.
    Plinkert, P.
    Komposch, G.
    Muessig, E.
    HNO, 2007, 55 (11) : 851 - 857
  • [2] Effects of Velopharyngeal Dysfunction on Middle Ear of Repaired Cleft Palate Patients
    da Silva, Daniela Preto
    Martins Collares, Marcus Vinicius
    da Costa, Sady Selaimen
    CLEFT PALATE-CRANIOFACIAL JOURNAL, 2010, 47 (03) : 225 - 233
  • [3] The effect of nasopharyngoscopic biofeedback in patients with cleft palate and velopharyngeal dysfunction
    Brunner, M
    Stellzig-Eisenhauer, A
    Pröschel, U
    Verres, R
    Komposch, G
    CLEFT PALATE-CRANIOFACIAL JOURNAL, 2005, 42 (06) : 649 - 657
  • [4] Treatment algorithm for velopharyngeal dysfunction in patients with cleft palate: a systematic review
    Asar, Aseel
    Gaber, Ramy
    Yehia, Mahmoud
    El-Kassaby, Marwa A. W.
    BRITISH JOURNAL OF ORAL & MAXILLOFACIAL SURGERY, 2023, 61 (04) : 259 - 266
  • [5] Surgical treatment of velopharyngeal dysfunction: Incidence and associated factors in the Swedish cleft palate population
    Johansson, Malin Schaar
    Becker, Magnus
    Eriksson, Marie
    Stiernman, Mia
    Klinto, Kristina
    JOURNAL OF PLASTIC RECONSTRUCTIVE AND AESTHETIC SURGERY, 2024, 90 : 240 - 248
  • [6] Speech Outcomes Following Orticochea Pharyngoplasty in Patients With History of Cleft Palate and Noncleft Velopharyngeal Dysfunction
    Birch, Alison L.
    Jordan, Zoe, V
    Ferguson, Louisa M.
    Kelly, Clare B.
    Boorman, John G.
    CLEFT PALATE CRANIOFACIAL JOURNAL, 2022, 59 (03) : 277 - 290
  • [7] Cleft palate speech and velopharyngeal dysfunction: the approach of the speech therapist
    De Bodt, M.
    Van Lierde, K.
    B-ENT, 2006, : 63 - 70
  • [8] Preoperative Velopharyngeal Morphology in Older Cleft Palate Patients With Postoperative Velopharyngeal Closure Versus Velopharyngeal Insufficiency
    Ma, Li
    Zheng, Qian
    Li, Yang
    Ma, Hongfang
    Shi, Jiayu
    Shi, Bing
    JOURNAL OF CRANIOFACIAL SURGERY, 2013, 24 (05) : 1720 - 1723
  • [9] What's New in Cleft Palate and Velopharyngeal Dysfunction Management?
    Naran, Sanjay
    Ford, Matthew
    Losee, Joseph E.
    PLASTIC AND RECONSTRUCTIVE SURGERY, 2017, 139 (06) : 1343E - 1355E
  • [10] Comparison of velopharyngeal morphology of two palatoplasty techniques in patients with hard and soft cleft palate
    Fan, Xiaofen
    Liu, Weilong
    Nie, Jiancun
    Chen, Xiaoxuan
    Dong, Yingchun
    Lu, Yong
    FRONTIERS IN SURGERY, 2023, 9