Enhancing intra-aural disease classification with attention-based deep learning models

被引：0

作者：

Furkancan Demircan ^{[1
]}

Murat Ekinci ^{[2
]}

Zafer Cömert ^{[1
]}

机构：

[1] Software Engineering, Faculty of Engineering and Natural Sciences, Samsun University, 19 Mayıs, Samsun

[2] Computer Engineering, Faculty of Engineering, Karadeniz Technical University, Trabzon, Ortahisar

[3] Department of Technical Sciences of the Western Caspian University, Baku

来源：

Neural Computing and Applications | 2025年 / 37卷 / 9期

关键词：

Classification; Deep learning; Ear diseases; Machine learning; Transformers;

D O I：

10.1007/s00521-025-10990-4

中图分类号：

学科分类号：

摘要：

Ear diseases are defined as pathological conditions that indicate dysfunction or abnormal function of the ear organ, which is part of the auditory system of living organisms that regulates hearing and balance functions. These diseases usually manifest as conditions that affect the internal components of the ear structure and can manifest themselves with symptoms such as hearing loss, ear pain, balance problems, and fluid accumulation in the ear. The accuracy of the diagnosis depends on expert knowledge and subjective opinion. This method is prone to human error. This study presents a novel computer-aided diagnosis system for otoscope images of ear diseases, utilizing a vision transformer-based feature extractor combined with machine learning classifiers to provide accurate second opinions for ENT specialists. For this purpose, a new model based on state-of-the-art vision transformer feature extractor and machine learning models is proposed. In the experimental study, the dataset, comprising 880 eardrum images categorized into four classes (CSOM, earwax, myringosclerosis, and normal), was split into training (70%), validation (10%), and testing (20%) subsets. Each image was preprocessed to 420 × 380 pixels to fit the input dimensions of the models. The vision transformer architecture was utilized for feature extraction, followed by classification using various machine learning algorithms including kNN, SVM, and random forest. As a result, the model using vision transformer feature extractor and k-nearest neighbors (kNN) algorithm achieved 99.00% accuracy. In this study, a deep learning-based and computer-aided diagnosis system, in other words, a computational model, was developed instead of the current human error-prone disease diagnosis method used by ear nose throat (ENT) specialists. The main purpose of the deep learning-based decision support system is to support the diagnosis process where expert knowledge is difficult to access and to provide an alternative opinion to the expert diagnosis. © The Author(s) 2025.

引用

页码：6601 / 6616

页数：15

共 30 条

[1] Connolly K., Gonzalez-Cordero A., Modelling inner ear development and disease using pluripotent stem cells - a pathway to new therapeutic strategies, DMM Disease Model Mech, (2022)
[2] Primary ear and hearing care training manual, (2023)
[3] Sutton R.T., Pincock D., Baumgart D.C., Et al., An overview of clinical decision support systems: benefits, risks, and strategies for success, NPJ Digit Med, (2020)
[4] Wu Z., Lin Z., Li L., Et al., Deep learning for classification of pediatric otitis media, Laryngoscope, 131, pp. E2344-E2351, (2021)
[5] Sundgaard J.V., Harte J., Bray P., Et al., Deep metric learning for otitis media classification, Med Image Anal, (2021)
[6] Alhudhaif A., Comert Z., Polat K., Otitis media detection using tympanic membrane images with a novel multi-class machine learning algorithm, PeerJ Comput Sci, 7, pp. 1-22, (2021)
[7] Tran T.T., Fang T.Y., Pham V.T., Et al., Development of an automatic diagnostic algorithm for pediatric otitis media, Otol Neurotol, 39, pp. 1060-1065, (2018)
[8] Myburgh H.C., Jose S., Swanepoel D.W., Laurent C., Towards low cost automated smartphone- and cloud-based otitis media diagnosis, Biomed Signal Process Control, 39, pp. 34-52, (2018)
[9] Mohammed K.K., Hassanien A.E., Afify H.M., Classification of ear imagery database using bayesian optimization based on CNN-LSTM architecture, J Digit Imaging, 35, pp. 947-961, (2022)
[10] Ucar M., Akyol K., Atila U.E., Classification of different tympanic membrane conditions using fused deep hypercolumn features and bidirectional LSTM, IRBM, 43, pp. 187-197, (2022)

← 1 2 3 →