Development and Comparison of Machine Learning and Deep Learning Models for Speech Audiometry Prediction

被引：0

作者：

Shin, Jae sung ^{[1
]}

Ma, Jun ^{[1
]}

Makara, Mao ^{[1
]}

Sung, Nak-Jun ^{[2
]}

Choi, Seong Jun ^{[3
]}

Kim, Sung yeup ^{[4
]}

Hong, Min ^{[5
]}

机构：

[1] Soonchunhyang Univ, Dept Software Convergence, Asan 31538, South Korea

[2] Natl Canc Ctr, Res Inst, Goyang 10245, South Korea

[3] Soonchunhyang Univ, Coll Med, Cheonan Hosp, Dept Otorhinolaryngol Head & Neck Surg, Cheonan 31151, South Korea

[4] Soonchunhyang Univ, Insitute Artificial Intelligence & Software, Asan 31538, South Korea

[5] Soonchunhyang Univ, Dept Comp Software Engn, Asan 31538, South Korea

来源：

APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 06期

关键词：

speech audiometry; pure-tone audiometry; multilayer perceptron; recurrent neural network; gradient boosting; XGBoost; regression analysis in audiology; deep learning; machine learning; hearing assessment; PURE-TONE AUDIOMETRY;

D O I：

10.3390/app15063071

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Hearing loss significantly impacts daily communication, making accurate speech audiometry (SA) assessment essential for diagnosis and treatment. However, SA testing is time-consuming and resource-intensive, limiting its accessibility in clinical practice. This study aimed to develop a multi-class classification model that predicts SA results using pure-tone audiometry (PTA) data, enabling a more efficient and automated assessment. To achieve this, we implemented and compared MLP, RNN, gradient boosting, and XGBoost models, evaluating their performance using accuracy, F1 score, log loss, and confusion matrix analysis. Experimental results showed that gradient boosting achieved the highest accuracy, 86.22%, while XGBoost demonstrated a more balanced classification performance. The MLP achieved 85.77% and the RNN achieved 85.41%, exhibiting relatively low accuracy, with the RNN showing limitations due to the low temporal dependency of PTA data. Additionally, all models faced challenges predicting class 2 (borderline hearing levels) due to overlapping data distributions. These findings suggest that machine learning models, particularly gradient boosting and XGBoost, outperform deep learning models in SA prediction. Future research should focus on feature engineering, hyperparameter optimization, and ensemble approaches to enhance performance and validate real-world applicability. The proposed model could contribute to automating SA prediction and improving hearing assessment efficiency and patient care.

引用

页数：22

共 33 条

[31] Performance and Reliability Evaluation of an Automated Bone-Conduction Audiometry Using Machine Learning [J].

Wallaert, Nicolas ;

Perry, Antoine ;

Jean, Hadrien ;

Creff, Gwenaelle ;

Godey, Benoit ;

Paraouty, Nihaad .

TRENDS IN HEARING, 2024, 28

[32] Simplified frequency selectivity measure as a potential candidate for hearing screening: changes with masker level and test-retest reliability of self-administered testing [J].

Wang, Xinyang ;

Rasidi, Wan Nur Asyiqin ;

Seluakumaran, Kumar .

INTERNATIONAL JOURNAL OF AUDIOLOGY, 2024,

[33]

Zhao F., 2021, Manual of Clinical Phonetics, P444

← 1 2 3 4 →