Performance Comparison of the Deep Learning and the Human Endoscopist for Bleeding Peptic Ulcer Disease

被引:27
|
作者
Yen, Hsu-Heng [1 ,2 ,3 ]
Wu, Ping-Yu [3 ]
Su, Pei-Yuan [1 ]
Yang, Chia-Wei [1 ]
Chen, Yang-Yuan [1 ]
Chen, Mei-Fen [3 ,5 ]
Lin, Wen-Chen [5 ]
Tsai, Cheng-Lun [4 ,5 ]
Lin, Kang-Ping [3 ,5 ]
机构
[1] Changhua Christian Hosp, Div Gastroenterol, Dept Internal Med, Changhua, Taiwan
[2] Chien Kuo Technol Univ, Gen Educ Ctr, Changhua, Taiwan
[3] Chung Yuan Christian Univ, Dept Elect Engn, Taoyuan, Taiwan
[4] Chung Yuan Christian Univ, Dept Biomed Engn, Taoyuan, Taiwan
[5] Chung Yuan Christian Univ, Technol Translat Ctr Med Device, 200 Chung Pei Rd, Taoyuan 32023, Taiwan
关键词
Peptic ulcer; Bleeding; Deep learning; Artificial intelligence;
D O I
10.1007/s40846-021-00608-0
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Purpose Management of peptic ulcer bleeding is clinically challenging. Accurate characterization of the bleeding during endoscopy is key for endoscopic therapy. This study aimed to assess whether a deep learning model can aid in the classification of bleeding peptic ulcer disease. Methods Endoscopic still images of patients (n = 1694) with peptic ulcer bleeding for the last 5 years were retrieved and reviewed. Overall, 2289 images were collected for deep learning model training, and 449 images were validated for the performance test. Two expert endoscopists classified the images into different classes based on their appearance. Four deep learning models, including Mobile Net V2, VGG16, Inception V4, and ResNet50, were proposed and pre-trained by ImageNet with the established convolutional neural network algorithm. A comparison of the endoscopists and trained deep learning model was performed to evaluate the model's performance on a dataset of 449 testing images. Results The results first presented the performance comparisons of four deep learning models. The Mobile Net V2 presented the optimal performance of the proposal models. The Mobile Net V2 was chosen for further comparing the performance with the diagnostic results obtained by one senior and one novice endoscopists. The sensitivity and specificity were acceptable for the prediction of "normal" lesions in both 3-class and 4-class classifications. For the 3-class category, the sensitivity and specificity were 94.83% and 92.36%, respectively. For the 4-class category, the sensitivity and specificity were 95.40% and 92.70%, respectively. The interobserver agreement of the testing dataset of the model was moderate to substantial with the senior endoscopist. The accuracy of the determination of endoscopic therapy required and high-risk endoscopic therapy of the deep learning model was higher than that of the novice endoscopist. Conclusions In this study, the deep learning model performed better than inexperienced endoscopists. Further improvement of the model may aid in clinical decision-making during clinical practice, especially for trainee endoscopist.
引用
收藏
页码:504 / 513
页数:10
相关论文
共 50 条
  • [41] Predicting Human Performance in Vertical Menu Selection Using Deep Learning
    Li, Yang
    Bengio, Samy
    Bailly, Gilles
    PROCEEDINGS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2018), 2018,
  • [42] Scalogram based performance comparison of deep learning architectures for dysarthric speech detection
    Shabber, Shaik Mulla
    Sumesh, E. P.
    Ramachandran, Vidhya Lavanya
    ARTIFICIAL INTELLIGENCE REVIEW, 2025, 58 (05)
  • [43] Performance Comparison of Different EEG Analysis Techniques Based on Deep Learning Approaches
    Belsare, Swarali
    Kale, Maitreyi
    Ghayal, Priya
    Gogate, Aishwarya
    Itkar, Suhasini
    2021 INTERNATIONAL CONFERENCE ON EMERGING SMART COMPUTING AND INFORMATICS (ESCI), 2021, : 490 - 493
  • [44] Comparison Performance on SOTA Deep Learning Models for Coffee Beans Grading Inspection
    Firdaus, Achmad Norman
    Rakhmawati, Amalia
    Nadhira, Vebi
    Suprijanto
    Juliastuti, Endang
    Utomo, Gema Nuran
    Risangtuni, Ayu Gareta
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 100 - 105
  • [45] Freezing of Gait detection in Parkinson's disease: comparison of deep learning frameworks
    Andrei, Alexandra-Georgiana
    Tautan, Alexandra-Maria
    Ionescu, Bogdan
    2024 IEEE INTERNATIONAL SYMPOSIUM ON MEDICAL MEASUREMENTS AND APPLICATIONS, MEMEA 2024, 2024,
  • [46] Comparison of the Deep Learning Performance for Short-Term Power Load Forecasting
    Son, Namrye
    SUSTAINABILITY, 2021, 13 (22)
  • [47] Performance evaluation of plant leaf disease detection using deep learning models
    Singh, Gulbir
    Yogi, Kuldeep Kumar
    ARCHIVES OF PHYTOPATHOLOGY AND PLANT PROTECTION, 2023, 56 (03) : 209 - 233
  • [48] A Performance Based Study on Deep Learning Algorithms in the Efficient Prediction of Heart Disease
    Islam, Rakibul
    Beeravolu, Abhijit Reddy
    Islam, Md Al Habib
    Karim, Asif
    Azam, Sami
    Mukti, Sanzida Akter
    2ND INTERNATIONAL INFORMATICS AND SOFTWARE ENGINEERING CONFERENCE (IISEC), 2021,
  • [49] Performance of a deep learning based neural network in the selection of human blastocysts for implantation
    Bormann, Charles L.
    Kanakasabapathy, Manoj Kumar
    Thirumalaraju, Prudhvi
    Gupta, Raghav
    Pooniwala, Rohan
    Kandula, Hemanth
    Hariton, Eduardo
    Souter, Irene
    Dimitriadis, Irene
    Ramirez, Leslie B.
    Curchoe, Carol L.
    Swain, Jason
    Boehnlein, Lynn M.
    Shafiee, Hadi
    ELIFE, 2020, 9
  • [50] COMPARISON OF DEEP LEARNING MODEL PERFORMANCE BETWEEN META-DATASET TRAINING VERSUS DEEP NEURAL ENSEMBLES
    Hurt, J. Alex
    Scott, Grant J.
    Davis, Curt H.
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 1326 - 1329