Investigating the use of a two-stage attention-aware convolutional neural network for the automated diagnosis of otitis media from tympanic membrane images: a prediction model development and validation study

被引:24
作者
Cai, Yuexin [1 ,2 ]
Yu, Jin-Gang [3 ]
Chen, Yuebo [1 ,2 ]
Liu, Chu [1 ,2 ]
Xiao, Lichao [3 ]
Grais, Emad M. [4 ]
Zhao, Fei [4 ]
Lan, Liping [1 ,2 ]
Zeng, Shengxin [1 ,2 ]
Zeng, Junbo [1 ,2 ]
Wu, Minjian [1 ,2 ]
Su, Yuejia [1 ,2 ]
Li, Yuanqing [3 ]
Zheng, Yiqing [1 ,2 ]
机构
[1] Sun Yat Sen Univ, Sun Yat Sen Mem Hosp, Dept Otolaryngol, Guangzhou, Guangdong, Peoples R China
[2] Sun Yat Sen Univ, Inst Hearing & Speech Language Sci, Guangzhou, Guangdong, Peoples R China
[3] South China Univ Technol Sch, Dept Automat Sci & Engn, Guangzhou, Guangdong, Peoples R China
[4] Cardiff Metropolitan Univ, Cardiff Sch Sport & Hlth Sci, Ctr Speech & Language Therapy & Hearing Sci, Cardiff, Wales
基金
中国国家自然科学基金;
关键词
otolaryngology; endoscopic surgery; paediatric otolaryngology; adult otolaryngology; DEEP; SEGMENTATION;
D O I
10.1136/bmjopen-2020-041139
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objectives This study investigated the usefulness and performance of a two-stage attention-aware convolutional neural network (CNN) for the automated diagnosis of otitis media from tympanic membrane (TM) images. Design A classification model development and validation study in ears with otitis media based on otoscopic TM images. Two commonly used CNNs were trained and evaluated on the dataset. On the basis of a Class Activation Map (CAM), a two-stage classification pipeline was developed to improve accuracy and reliability, and simulate an expert reading the TM images. Setting and participants This is a retrospective study using otoendoscopic images obtained from the Department of Otorhinolaryngology in China. A dataset was generated with 6066 otoscopic images from 2022 participants comprising four kinds of TM images, that is, normal eardrum, otitis media with effusion (OME) and two stages of chronic suppurative otitis media (CSOM). Results The proposed method achieved an overall accuracy of 93.4% using ResNet50 as the backbone network in a threefold cross-validation. The F1 Score of classification for normal images was 94.3%, and 96.8% for OME. There was a small difference between the active and inactive status of CSOM, achieving 91.7% and 82.4% F1 scores, respectively. The results demonstrate a classification performance equivalent to the diagnosis level of an associate professor in otolaryngology. Conclusions CNNs provide a useful and effective tool for the automated classification of TM images. In addition, having a weakly supervised method such as CAM can help the network focus on discriminative parts of the image and improve performance with a relatively small database. This two-stage method is beneficial to improve the accuracy of diagnosis of otitis media for junior otolaryngologists and physicians in other disciplines.
引用
收藏
页数:7
相关论文
共 28 条
[1]  
Acuin J, 2007, BMJ CLIN EVID, V2007
[2]  
[Anonymous], 2018, DIAGNOSE RADIOLOGIST
[3]  
Barry KM, 2015, MODULATION MEDIAL GE, P1
[4]   Context-aware stacked convolutional neural networks for classification of breast carcinomas in whole-slide histopathology images [J].
Bejnordi, Babak Ehteshami ;
Zuidhof, Guido ;
Balkenhol, Maschenka ;
Hermsen, Meyke ;
Bult, Peter ;
Van Ginneken, Bram ;
Karssemeijer, Nico ;
Litjens, Geert ;
Van Der Laak, Jeroen .
Journal of Medical Imaging, 2017, 4 (04)
[5]   Deep neural networks are superior to dermatologists in melanoma image classification [J].
Brinker, Titus J. ;
Hekler, Achim ;
Enk, Alexander H. ;
Berking, Carola ;
Haferkamp, Sebastian ;
Hauschild, Axel ;
Weichenthal, Michael ;
Klode, Joachim ;
Schadendorf, Dirk ;
Holland-Letz, Tim ;
von Kalle, Christof ;
Froehling, Stefan ;
Schilling, Bastian ;
Utikal, Jochen S. .
EUROPEAN JOURNAL OF CANCER, 2019, 119 :11-17
[6]   Automated diagnosis of ear disease using ensemble deep learning with a big otoendoscopy image database [J].
Cha, Dongchul ;
Pae, Chongwon ;
Seong, Si-Baek ;
Choi, Jae Young ;
Park, Hae-Jeong .
EBIOMEDICINE, 2019, 45 :606-614
[7]   DCAN: Deep contour-aware networks for object instance segmentation from histology images [J].
Chen, Hao ;
Qi, Xiaojuan ;
Yu, Lequan ;
Dou, Qi ;
Qin, Jing ;
Heng, Pheng-Ann .
MEDICAL IMAGE ANALYSIS, 2017, 36 :135-146
[8]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[9]   Neural Mechanisms of Selective Visual Attention [J].
Moore, Tirin ;
Zirnsak, Marc .
ANNUAL REVIEW OF PSYCHOLOGY, VOL 68, 2017, 68 :47-72
[10]   Dermatologist-level classification of skin cancer with deep neural networks (vol 542, pg 115, 2017) [J].
Esteva, Andre ;
Kuprel, Brett ;
Novoa, Roberto A. ;
Ko, Justin ;
Swetter, Susan M. ;
Blau, Helen M. ;
Thrun, Sebastian .
NATURE, 2017, 546 (7660) :686-686