Leveraging large language models for automated detection of velopharyngeal dysfunction in patients with cleft palate

被引：0

作者：

Shirk, Myranda Uselton ^{[1
]}

Dang, Catherine ^{[1
]}

Cho, Jaewoo ^{[1
]}

Chen, Hanlin ^{[1
]}

Hofstetter, Lily ^{[1
]}

Bijur, Jack ^{[1
]}

Lucas, Claiborne ^{[2
]}

James, Andrew ^{[3
]}

Guzman, Ricardo-Torres ^{[3
]}

Hiller, Andrea ^{[3
]}

Alter, Noah ^{[3
]}

Stone, Amy ^{[4
]}

Powell, Maria ^{[4
]}

Pontell, Matthew E. ^{[3
,5
]}

机构：

[1] Vanderbilt Univ, Data Sci Inst, Nashville, TN USA

[2] Prisma Hlth Greenville, Dept Gen Surg, Greenville, SC USA

[3] Vanderbilt Univ, Med Ctr, Dept Plast Surg, Nashville, TN 37232 USA

[4] Vanderbilt Univ, Med Ctr, Dept Otolaryngol, Nashville, TN USA

[5] Monroe Carell Jr Childrens Hosp, Div Pediat Plast Surg, Nashville, TN 37232 USA

来源：

FRONTIERS IN DIGITAL HEALTH | 2025年 / 7卷

关键词：

velopharyngeal dysfunction (VPD); hypernasality detection; artificial intelligence (AI); cleft palate; machine learning (ML); speech diagnostics; QUALITY-OF-LIFE; HEALTH-CARE; INSUFFICIENCY; ASSOCIATION; GENETICS; CHILDREN;

D O I：

10.3389/fdgth.2025.1552746

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Background Hypernasality, a hallmark of velopharyngeal insufficiency (VPI), is a speech disorder with significant psychosocial and functional implications. Conventional diagnostic methods rely heavily on specialized expertise and equipment, posing challenges in resource-limited settings. This study explores the application of OpenAI's Whisper model for automated hypernasality detection, offering a scalable and efficient alternative to traditional approaches.Methods The Whisper model was adapted for binary classification by replacing its sequence-to-sequence decoder with a custom classification head. A dataset of 184 audio recordings, including 96 hypernasal (cases) and 88 non-hypernasal samples (controls), was used for training and evaluation. The Whisper model's performance was compared to traditional machine learning approaches, including support vector machines (SVM) and random forest (RF) classifiers.Results The Whisper-based model effectively detected hypernasality in speech, achieving a test accuracy of 97% and an F1-score of 0.97. It significantly outperformed SVM and RF classifiers, which achieved accuracies of 88.1% and 85.7%, respectively. Whisper demonstrated robust performance across diverse recording conditions and required minimal training data, showcasing its scalability and efficiency for hypernasality detection.Conclusion This study demonstrates the effectiveness of the Whisper-based model for hypernasality detection. By providing a reliable pretest probability, the Whisper model can serve as a triaging mechanism to prioritize patients for further evaluation, reducing diagnostic delays and optimizing resource allocation.

引用

页数：8

共 50 条

[41] Velopharyngeal incompetence in patients with cleft palate, flexible video pharyngoscopy and perceptual speech assessment: a correlational pilot study
Rajan, S.
Kurien, M.
Gupta, A. K.
Mathews, S. S.
Albert, R. R.
Tychicus, D.
[J]. JOURNAL OF LARYNGOLOGY AND OTOLOGY, 2014, 128 (11) : 986 - 990
[42] Velopharyngeal Insufficiency Rates After Delayed Cleft Palate Repair Lessons Learned From Internationally Adopted Patients
Follmar, Keith E.
Yuan, Nance
Pendleton, Courtney S.
Dorafshar, Amir H.
Kolk, Craig Vander
Redett, Richard J., III
[J]. ANNALS OF PLASTIC SURGERY, 2015, 75 (03) : 302 - 305
[43] Effect of maxillary advancement on speech and velopharyngeal function of patients with cleft palate: Systematic Review and Meta-Analysis
Sales, P. H. H.
Costa, F. W. G.
Cetira Filho, E. L.
Silva, P. G. B.
Albuquerque, A. F. M.
Leao, J. C.
[J]. INTERNATIONAL JOURNAL OF ORAL AND MAXILLOFACIAL SURGERY, 2021, 50 (01) : 64 - 74
[44] Speech Outcomes Following Operative Management of Velopharyngeal Dysfunction (VPD) in Non-Syndromic Post-Palatoplasty Cleft Palate Patients
Kimia, Rotem
Solot, Cynthia B. B.
McCormack, Susan M. M.
Cohen, Marilyn
Blum, Jessica D. D.
Villavisanis, Dillan F. F.
Vora, Nisha
Valenzuela, Zachary
Taylor, Jesse A. A.
Low, David W. W.
Jackson, Oksana A. A.
[J]. CLEFT PALATE CRANIOFACIAL JOURNAL, 2024, 61 (06) : 1007 - 1017
[45] Impact of Orofacial Dysfunction on the Quality of Life of Adult Patients With Cleft Lip and Palate
Reinaldo Mariano, Natalia Cristina
Sano, Mariana Naomi
Curvello, Victor Prado
Pompeia Fraga de Almeida, Ana Lucia
Neppelenbroek, Karin Hermana
Oliveira, Thais Marchini
Soares, Simone
[J]. CLEFT PALATE-CRANIOFACIAL JOURNAL, 2018, 55 (08) : 1138 - 1144
[46] Factors affecting articulation skills in children with velocardiofacial syndrome and children with cleft palate or velopharyngeal dysfunction: A preliminary report
Baylis, Adriane L.
Munson, Benjamin
Moller, Karlind T.
[J]. CLEFT PALATE-CRANIOFACIAL JOURNAL, 2008, 45 (02) : 193 - 207
[47] Management of Velopharyngeal Dysfunction (VPD) Following Cleft Palate Repair: A Comprehensive Decision-Making Process Based on Severity and Structural Deficiencies
Hussain, Syed Altaf
Vijayakumar, Charanya
Balasubramanian, Subramaniyan
Rahavi-Ezabadi, Sara
Sundar, Vishnu
Sybil, Deborah
Hussain, Zaid
[J]. CLEFT PALATE CRANIOFACIAL JOURNAL, 2024,
[48] Nasendoscopic Findings of Velopharyngeal Sphincter in Operated Cleft Palate Patients: Is It Different than Normal Population
Sharma, Akangsha
Sahu, Shamendra Anand
Agrawal, Karoon
[J]. INDIAN JOURNAL OF PLASTIC SURGERY, 2019, 52 (02) : 178 - 182
[49] Endoscopic evaluation of velopharyngeal function in cleft lip palate patients-A correlation with speech analysis
Mangal, Mayank
Kumar, Parmod
Munjal, Sanjay
Sharma, Ramesh Kumar
[J]. JOURNAL OF PLASTIC RECONSTRUCTIVE AND AESTHETIC SURGERY, 2023, 77 : 170 - 176
[50] Palatal Re-Repair With Z-Plasty in Treatment of Velopharyngeal Insufficiency of Syndromic and Nonsyndromic Patients With Cleft Palate
Ahti, Veera
Alaluusua, Suvi
Vuola, Pia
Rautio, Jorma
Leikola, Junnu
Saarikko, Anne
[J]. JOURNAL OF CRANIOFACIAL SURGERY, 2021, 32 (02) : 685 - 690

← 1 2 3 4 5 →