Diagnosis of Tympanic Membrane Disease and Pediatric Hearing Using Convolutional Neural Network Models with Multi-Layer Perceptrons

被引:1
作者
Lee, Hongchang [1 ]
Jang, Hyeonung [1 ]
Jeon, Wangsu [2 ]
Choi, Seongjun [3 ]
机构
[1] Haewootech Co Ltd, Busan 46742, South Korea
[2] Kyungnam Univ, Dept Comp Engn, Chang Won 51767, South Korea
[3] Soonchunhyang Univ, Cheonan Hosp, Coll Med, Dept Otolaryngol Head & Neck Surg, Cheonan 31151, South Korea
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 13期
关键词
deep learning; eardrum; ear diseases; EfficientNet; multi-class classification; multi-layer perceptron; pediatric hearing; regression; tympanic membrane; OTITIS-MEDIA; EFFUSION;
D O I
10.3390/app14135457
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this study, we propose a method of classification for tympanic membrane diseases and regression of pediatric hearing, using a deep learning model of artificial neural networks. Based on the B7 Backbone model of EfficientNet, a state-of-the-art convolutional neural network model, drop connect was applied in the encoder for generalization, and multi-layer perceptron, which is mainly used in the transformer, was applied to the decoder for improved accuracy. For the training data, the open-access tympanic membrane dataset, divided into four classes, was used as the benchmark dataset, and the SCH tympanic membrane dataset with five classes of tympanic membrane diseases and pediatric hearing was also used as the training dataset. In the benchmark using the open-access tympanic membrane dataset, the proposed model showed the highest performance among the five comparative models with an average accuracy of 93.59%, an average sensitivity of 87.19%, and an average specificity of 95.73%. In the experiment trained on the SCH tympanic membrane disease dataset, the average accuracy was 98.28%, the average sensitivity was 89.66%, the average specificity was 98.68%, and the average inference time was 0.2 s. In the experiment trained on the SCH pediatric hearing dataset, the mean absolute error was 6.8678, the mean squared logarithmic error was 0.2887, and the average inference time was 0.2 s.
引用
收藏
页数:20
相关论文
共 39 条
[1]  
Bengio Y., 2006, Advances in Neural Information Processing Systems, V19
[2]   Randaugment: Practical automated data augmentation with a reduced search space [J].
Cubuk, Ekin D. ;
Zoph, Barret ;
Shlens, Jonathon ;
Le, Quoc, V .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :3008-3017
[3]   A fast learning algorithm for deep belief nets [J].
Hinton, Geoffrey E. ;
Osindero, Simon ;
Teh, Yee-Whye .
NEURAL COMPUTATION, 2006, 18 (07) :1527-1554
[4]   Searching for MobileNetV3 [J].
Howard, Andrew ;
Sandler, Mark ;
Chu, Grace ;
Chen, Liang-Chieh ;
Chen, Bo ;
Tan, Mingxing ;
Wang, Weijun ;
Zhu, Yukun ;
Pang, Ruoming ;
Vasudevan, Vijay ;
Le, Quoc V. ;
Adam, Hartwig .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1314-1324
[5]   ROBUST ESTIMATION OF LOCATION PARAMETER [J].
HUBER, PJ .
ANNALS OF MATHEMATICAL STATISTICS, 1964, 35 (01) :73-&
[6]  
Ioffe Sergey, 2015, Proceedings of Machine Learning Research, V37, P448, DOI DOI 10.48550/ARXIV.1502.03167
[7]  
kaggle, Open-Access Tympanic Membrane Dataset
[8]  
Kingma Diederik P, 2014, ARXIV PREPRINT ARXIV
[9]   The aetiology of otitis media with effusion: a review [J].
Kubba, H ;
Pearson, JP ;
Birchall, JP .
CLINICAL OTOLARYNGOLOGY, 2000, 25 (03) :181-194
[10]   Backpropagation Applied to Handwritten Zip Code Recognition [J].
LeCun, Y. ;
Boser, B. ;
Denker, J. S. ;
Henderson, D. ;
Howard, R. E. ;
Hubbard, W. ;
Jackel, L. D. .
NEURAL COMPUTATION, 1989, 1 (04) :541-551