Performance Comparison of the Deep Learning and the Human Endoscopist for Bleeding Peptic Ulcer Disease

被引:27
|
作者
Yen, Hsu-Heng [1 ,2 ,3 ]
Wu, Ping-Yu [3 ]
Su, Pei-Yuan [1 ]
Yang, Chia-Wei [1 ]
Chen, Yang-Yuan [1 ]
Chen, Mei-Fen [3 ,5 ]
Lin, Wen-Chen [5 ]
Tsai, Cheng-Lun [4 ,5 ]
Lin, Kang-Ping [3 ,5 ]
机构
[1] Changhua Christian Hosp, Div Gastroenterol, Dept Internal Med, Changhua, Taiwan
[2] Chien Kuo Technol Univ, Gen Educ Ctr, Changhua, Taiwan
[3] Chung Yuan Christian Univ, Dept Elect Engn, Taoyuan, Taiwan
[4] Chung Yuan Christian Univ, Dept Biomed Engn, Taoyuan, Taiwan
[5] Chung Yuan Christian Univ, Technol Translat Ctr Med Device, 200 Chung Pei Rd, Taoyuan 32023, Taiwan
关键词
Peptic ulcer; Bleeding; Deep learning; Artificial intelligence;
D O I
10.1007/s40846-021-00608-0
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Purpose Management of peptic ulcer bleeding is clinically challenging. Accurate characterization of the bleeding during endoscopy is key for endoscopic therapy. This study aimed to assess whether a deep learning model can aid in the classification of bleeding peptic ulcer disease. Methods Endoscopic still images of patients (n = 1694) with peptic ulcer bleeding for the last 5 years were retrieved and reviewed. Overall, 2289 images were collected for deep learning model training, and 449 images were validated for the performance test. Two expert endoscopists classified the images into different classes based on their appearance. Four deep learning models, including Mobile Net V2, VGG16, Inception V4, and ResNet50, were proposed and pre-trained by ImageNet with the established convolutional neural network algorithm. A comparison of the endoscopists and trained deep learning model was performed to evaluate the model's performance on a dataset of 449 testing images. Results The results first presented the performance comparisons of four deep learning models. The Mobile Net V2 presented the optimal performance of the proposal models. The Mobile Net V2 was chosen for further comparing the performance with the diagnostic results obtained by one senior and one novice endoscopists. The sensitivity and specificity were acceptable for the prediction of "normal" lesions in both 3-class and 4-class classifications. For the 3-class category, the sensitivity and specificity were 94.83% and 92.36%, respectively. For the 4-class category, the sensitivity and specificity were 95.40% and 92.70%, respectively. The interobserver agreement of the testing dataset of the model was moderate to substantial with the senior endoscopist. The accuracy of the determination of endoscopic therapy required and high-risk endoscopic therapy of the deep learning model was higher than that of the novice endoscopist. Conclusions In this study, the deep learning model performed better than inexperienced endoscopists. Further improvement of the model may aid in clinical decision-making during clinical practice, especially for trainee endoscopist.
引用
收藏
页码:504 / 513
页数:10
相关论文
共 50 条
  • [31] A COMPARISON OF THE MENTAL STATUS, PERSONALITY PROFILES AND LIFE EVENTS OF PATIENTS WITH IRRITABLE-BOWEL-SYNDROME AND PEPTIC-ULCER DISEASE
    DINAN, TG
    OKEANE, V
    OBOYLE, C
    CHUA, A
    KEELING, PWN
    ACTA PSYCHIATRICA SCANDINAVICA, 1991, 84 (01) : 26 - 28
  • [32] Performance Comparison of Deep Learning Models for Damage Identification of Aging Bridges
    Chung, Su-Wan
    Hong, Sung-Sam
    Kim, Byung-Kon
    APPLIED SCIENCES-BASEL, 2023, 13 (24):
  • [33] Development of deep learning segmentation models for coronary X-ray angiography: Quality assessment by a new global segmentation score and comparison with human performance
    Menezes, Miguel Nobre
    Lourenco-Silva, Joao
    Silva, Beatriz
    Rodrigues, Oliveira
    Francisco, Ana Rita G.
    Ferreira, Pedro Carrilho
    Oliveira, Arlindo L.
    Pinto, Fausto J.
    REVISTA PORTUGUESA DE CARDIOLOGIA, 2022, 41 (12) : 1011 - 1021
  • [34] Artificial intelligence-based analysis of whole-body bone scintigraphy: The quest for the optimal deep learning algorithm and comparison with human observer performance
    Hajianfar, Ghasem
    Sabouri, Maziar
    Salimi, Yazdan
    Amini, Mehdi
    Bagheri, Soroush
    Jenabi, Elnaz
    Hekmat, Sepideh
    Maghsudi, Mehdi
    Mansouri, Zahra
    Khateri, Maziar
    Jamshidi, Mohammad Hosein
    Jafari, Esmail
    Rajabi, Ahmad Bitarafan
    Assadi, Majid
    Oveisi, Mehrdad
    Shiri, Isaac
    Zaidi, Habib
    ZEITSCHRIFT FUR MEDIZINISCHE PHYSIK, 2024, 34 (02): : 242 - 257
  • [35] Performance Comparison of Pretrained Deep Learning Models for Landfill Waste Classification
    Younis, Hussein
    Obaid, Mahmoud
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (11) : 689 - 698
  • [36] A Comprehensive Study of Deep Learning and Performance Comparison of Deep Neural Network Models (YOLO, RetinaNet)
    Nife, Nadia Ibrahim
    Chtourou, Mohamed
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (12) : 62 - 77
  • [37] Comparison of Deep Transfer Learning Techniques in Human Skin Burns Discrimination
    Abubakar, Aliyu
    Ajuji, Mohammed
    Yahya, Ibrahim Usman
    APPLIED SYSTEM INNOVATION, 2020, 3 (02) : 1 - 15
  • [38] Machine Learning and Deep Learning Models for Diagnosis of Parkinson's Disease: A Performance Analysis
    Mounika, P.
    Rao, S. Govinda
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 381 - 388
  • [39] Performance Comparison of Machine Learning and Deep Learning While Classifying Driver's Cognitive State
    Bhardwaj, Rahul
    Parameswaran, Swathy
    Balasubramanian, Venkatesh
    2018 IEEE 13TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (IEEE ICIIS), 2018, : 102 - 106
  • [40] Performance analysis of deep learning architectures for plant leaves disease detection
    Dahiya S.
    Gulati T.
    Gupta D.
    Measurement: Sensors, 2022, 24