Development and Comparison of Machine Learning and Deep Learning Models for Speech Audiometry Prediction

被引:0
作者
Shin, Jae sung [1 ]
Ma, Jun [1 ]
Makara, Mao [1 ]
Sung, Nak-Jun [2 ]
Choi, Seong Jun [3 ]
Kim, Sung yeup [4 ]
Hong, Min [5 ]
机构
[1] Soonchunhyang Univ, Dept Software Convergence, Asan 31538, South Korea
[2] Natl Canc Ctr, Res Inst, Goyang 10245, South Korea
[3] Soonchunhyang Univ, Coll Med, Cheonan Hosp, Dept Otorhinolaryngol Head & Neck Surg, Cheonan 31151, South Korea
[4] Soonchunhyang Univ, Insitute Artificial Intelligence & Software, Asan 31538, South Korea
[5] Soonchunhyang Univ, Dept Comp Software Engn, Asan 31538, South Korea
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 06期
关键词
speech audiometry; pure-tone audiometry; multilayer perceptron; recurrent neural network; gradient boosting; XGBoost; regression analysis in audiology; deep learning; machine learning; hearing assessment; PURE-TONE AUDIOMETRY;
D O I
10.3390/app15063071
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Hearing loss significantly impacts daily communication, making accurate speech audiometry (SA) assessment essential for diagnosis and treatment. However, SA testing is time-consuming and resource-intensive, limiting its accessibility in clinical practice. This study aimed to develop a multi-class classification model that predicts SA results using pure-tone audiometry (PTA) data, enabling a more efficient and automated assessment. To achieve this, we implemented and compared MLP, RNN, gradient boosting, and XGBoost models, evaluating their performance using accuracy, F1 score, log loss, and confusion matrix analysis. Experimental results showed that gradient boosting achieved the highest accuracy, 86.22%, while XGBoost demonstrated a more balanced classification performance. The MLP achieved 85.77% and the RNN achieved 85.41%, exhibiting relatively low accuracy, with the RNN showing limitations due to the low temporal dependency of PTA data. Additionally, all models faced challenges predicting class 2 (borderline hearing levels) due to overlapping data distributions. These findings suggest that machine learning models, particularly gradient boosting and XGBoost, outperform deep learning models in SA prediction. Future research should focus on feature engineering, hyperparameter optimization, and ensemble approaches to enhance performance and validate real-world applicability. The proposed model could contribute to automating SA prediction and improving hearing assessment efficiency and patient care.
引用
收藏
页数:22
相关论文
共 33 条
  • [1] Comparing pure-tone audiometry and auditory steady state response for the measurement of hearing loss
    Ahn, Joong Ho
    Lee, Hyo-Sook
    Kim, Young-Jin
    Yoon, Tae Hyun
    Chung, Jong Woo
    [J]. OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2007, 136 (06) : 966 - 971
  • [2] The state of the art of deep learning models in medical science and their challenges
    Bhatt, Chandradeep
    Kumar, Indrajeet
    Vijayakumar, V.
    Singh, Kamred Udham
    Kumar, Abhishek
    [J]. MULTIMEDIA SYSTEMS, 2021, 27 (04) : 599 - 613
  • [3] Influence of the type of acoustic transducer in pure-tone audiometry
    Born, Nathalia Montandon
    Marciano, Mariane da Silva
    Mass, Stephanie da Costa
    Camargo da Silva, Daniela Polo
    Scharlach, Renata Coelho
    [J]. CODAS, 2022, 34 (03):
  • [4] Brandt JP, 2023, StatPearls
  • [5] A systematic review of relational-based therapies for the treatment of auditory hallucinations in patients with psychotic disorders
    Dellazizzo, Laura
    Giguere, Sabrina
    Leveille, Nayla
    Potvin, Stephane
    Dumais, Alexandre
    [J]. PSYCHOLOGICAL MEDICINE, 2022, 52 (11) : 2001 - 2008
  • [6] Effect of Digital Noise Reduction in Hearing Aids on Speech Intelligibility in Both Quiet and Noisy Environments
    Deniz, Burcu
    Gulmez, Zuleyha D.
    Kara, Halide
    Kara, Eyyup
    [J]. NOISE & HEALTH, 2024, 26 (121) : 220 - 225
  • [7] DeRuiter M., 2021, Basic Audiometry Learning Manual
  • [8] The Benefit of Air Conduction Pure-Tone Audiometry as a Screening Method for Hearing Loss over the VAS Score
    Giotakis, Aris I.
    Mariolis, Lambros
    Koulentis, Ioannis
    Mpoutris, Christos
    Giotakis, Evangelos I.
    Apostolopoulou, Aikaterini
    Papaefstathiou, Efstathios
    [J]. DIAGNOSTICS, 2024, 14 (01)
  • [9] Hoff M, 2024, INT J AUDIOL, V63, P622, DOI [10.1080/14992027.2023.2220909, 10.1109/INFOCOMWKSHPS57453.2023.10226043]
  • [10] Factors Underlying Individual Differences in Speech-Recognition Threshold (SRT) in Noise Among Older Adults
    Humes, Larry E.
    [J]. FRONTIERS IN AGING NEUROSCIENCE, 2021, 13