Performance Comparison of the Deep Learning and the Human Endoscopist for Bleeding Peptic Ulcer Disease

被引:27
|
作者
Yen, Hsu-Heng [1 ,2 ,3 ]
Wu, Ping-Yu [3 ]
Su, Pei-Yuan [1 ]
Yang, Chia-Wei [1 ]
Chen, Yang-Yuan [1 ]
Chen, Mei-Fen [3 ,5 ]
Lin, Wen-Chen [5 ]
Tsai, Cheng-Lun [4 ,5 ]
Lin, Kang-Ping [3 ,5 ]
机构
[1] Changhua Christian Hosp, Div Gastroenterol, Dept Internal Med, Changhua, Taiwan
[2] Chien Kuo Technol Univ, Gen Educ Ctr, Changhua, Taiwan
[3] Chung Yuan Christian Univ, Dept Elect Engn, Taoyuan, Taiwan
[4] Chung Yuan Christian Univ, Dept Biomed Engn, Taoyuan, Taiwan
[5] Chung Yuan Christian Univ, Technol Translat Ctr Med Device, 200 Chung Pei Rd, Taoyuan 32023, Taiwan
关键词
Peptic ulcer; Bleeding; Deep learning; Artificial intelligence;
D O I
10.1007/s40846-021-00608-0
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Purpose Management of peptic ulcer bleeding is clinically challenging. Accurate characterization of the bleeding during endoscopy is key for endoscopic therapy. This study aimed to assess whether a deep learning model can aid in the classification of bleeding peptic ulcer disease. Methods Endoscopic still images of patients (n = 1694) with peptic ulcer bleeding for the last 5 years were retrieved and reviewed. Overall, 2289 images were collected for deep learning model training, and 449 images were validated for the performance test. Two expert endoscopists classified the images into different classes based on their appearance. Four deep learning models, including Mobile Net V2, VGG16, Inception V4, and ResNet50, were proposed and pre-trained by ImageNet with the established convolutional neural network algorithm. A comparison of the endoscopists and trained deep learning model was performed to evaluate the model's performance on a dataset of 449 testing images. Results The results first presented the performance comparisons of four deep learning models. The Mobile Net V2 presented the optimal performance of the proposal models. The Mobile Net V2 was chosen for further comparing the performance with the diagnostic results obtained by one senior and one novice endoscopists. The sensitivity and specificity were acceptable for the prediction of "normal" lesions in both 3-class and 4-class classifications. For the 3-class category, the sensitivity and specificity were 94.83% and 92.36%, respectively. For the 4-class category, the sensitivity and specificity were 95.40% and 92.70%, respectively. The interobserver agreement of the testing dataset of the model was moderate to substantial with the senior endoscopist. The accuracy of the determination of endoscopic therapy required and high-risk endoscopic therapy of the deep learning model was higher than that of the novice endoscopist. Conclusions In this study, the deep learning model performed better than inexperienced endoscopists. Further improvement of the model may aid in clinical decision-making during clinical practice, especially for trainee endoscopist.
引用
收藏
页码:504 / 513
页数:10
相关论文
共 50 条
  • [21] Performance Comparison of Deep and Machine Learning Approaches Toward COVID-19 Detection
    Yahyaoui, Amani
    Rasheed, Jawad
    Alsubai, Shtwai
    Shubair, Raed M.
    Alqahtani, Abdullah
    Isler, Buket
    Haider, Rana Zeeshan
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (02) : 2247 - 2261
  • [22] Real Time Performance Comparison of Multi-class Deep Learning Methods at the Edge
    Sonmez, Doruk
    Cetin, Aydin
    ARTIFICIAL INTELLIGENCE AND APPLIED MATHEMATICS IN ENGINEERING PROBLEMS, 2020, 43 : 853 - 867
  • [23] Monthly streamflow prediction and performance comparison of machine learning and deep learning methods
    Ayana, Omer
    Kanbak, Deniz Furkan
    Keles, Muemine Kaya
    Turhan, Evren
    ACTA GEOPHYSICA, 2023, 71 (06) : 2905 - 2922
  • [24] Monthly streamflow prediction and performance comparison of machine learning and deep learning methods
    Ömer Ayana
    Deniz Furkan Kanbak
    Mümine Kaya Keleş
    Evren Turhan
    Acta Geophysica, 2023, 71 : 2905 - 2922
  • [25] Performance Analysis and CPU vs GPU Comparison for Deep Learning
    Buber, Ebubekir
    Diri, Banu
    2018 6TH INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING & INFORMATION TECHNOLOGY (CEIT), 2018,
  • [26] Machine Learning and Deep Learning for Crop Disease Diagnosis: Performance Analysis and Review
    Ngugi, Habiba Njeri
    Akinyelu, Andronicus A.
    Ezugwu, Absalom E.
    AGRONOMY-BASEL, 2024, 14 (12):
  • [28] Generalization Performance of Deep Learning Models in Neurodegenerative Disease Classification
    Yagis, Ekin
    De Herrera, Alba G. Seco
    Citi, Luca
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 1692 - 1698
  • [29] Performance benchmark of deep learning human pose estimation for UAVs
    Kalampokas, Theofanis
    Krinidis, Stelios
    Chatzis, Vassilios
    Papakostas, George A.
    MACHINE VISION AND APPLICATIONS, 2023, 34 (06)
  • [30] Performance benchmark of deep learning human pose estimation for UAVs
    Theofanis Kalampokas
    Stelios Krinidis
    Vassilios Chatzis
    George A. Papakostas
    Machine Vision and Applications, 2023, 34