Performance Comparison of the Deep Learning and the Human Endoscopist for Bleeding Peptic Ulcer Disease

被引：27

作者：

Yen, Hsu-Heng ^{[1
,2
,3
]}

Wu, Ping-Yu ^{[3
]}

Su, Pei-Yuan ^{[1
]}

Yang, Chia-Wei ^{[1
]}

Chen, Yang-Yuan ^{[1
]}

Chen, Mei-Fen ^{[3
,5
]}

Lin, Wen-Chen ^{[5
]}

Tsai, Cheng-Lun ^{[4
,5
]}

Lin, Kang-Ping ^{[3
,5
]}

机构：

[1] Changhua Christian Hosp, Div Gastroenterol, Dept Internal Med, Changhua, Taiwan

[2] Chien Kuo Technol Univ, Gen Educ Ctr, Changhua, Taiwan

[3] Chung Yuan Christian Univ, Dept Elect Engn, Taoyuan, Taiwan

[4] Chung Yuan Christian Univ, Dept Biomed Engn, Taoyuan, Taiwan

[5] Chung Yuan Christian Univ, Technol Translat Ctr Med Device, 200 Chung Pei Rd, Taoyuan 32023, Taiwan

来源：

JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING | 2021年 / 41卷 / 04期

关键词：

Peptic ulcer; Bleeding; Deep learning; Artificial intelligence;

D O I：

10.1007/s40846-021-00608-0

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Purpose Management of peptic ulcer bleeding is clinically challenging. Accurate characterization of the bleeding during endoscopy is key for endoscopic therapy. This study aimed to assess whether a deep learning model can aid in the classification of bleeding peptic ulcer disease. Methods Endoscopic still images of patients (n = 1694) with peptic ulcer bleeding for the last 5 years were retrieved and reviewed. Overall, 2289 images were collected for deep learning model training, and 449 images were validated for the performance test. Two expert endoscopists classified the images into different classes based on their appearance. Four deep learning models, including Mobile Net V2, VGG16, Inception V4, and ResNet50, were proposed and pre-trained by ImageNet with the established convolutional neural network algorithm. A comparison of the endoscopists and trained deep learning model was performed to evaluate the model's performance on a dataset of 449 testing images. Results The results first presented the performance comparisons of four deep learning models. The Mobile Net V2 presented the optimal performance of the proposal models. The Mobile Net V2 was chosen for further comparing the performance with the diagnostic results obtained by one senior and one novice endoscopists. The sensitivity and specificity were acceptable for the prediction of "normal" lesions in both 3-class and 4-class classifications. For the 3-class category, the sensitivity and specificity were 94.83% and 92.36%, respectively. For the 4-class category, the sensitivity and specificity were 95.40% and 92.70%, respectively. The interobserver agreement of the testing dataset of the model was moderate to substantial with the senior endoscopist. The accuracy of the determination of endoscopic therapy required and high-risk endoscopic therapy of the deep learning model was higher than that of the novice endoscopist. Conclusions In this study, the deep learning model performed better than inexperienced endoscopists. Further improvement of the model may aid in clinical decision-making during clinical practice, especially for trainee endoscopist.

引用

页码：504 / 513

页数：10

共 50 条

[21] Performance Comparison of Deep and Machine Learning Approaches Toward COVID-19 Detection
Yahyaoui, Amani
Rasheed, Jawad
Alsubai, Shtwai
Shubair, Raed M.
Alqahtani, Abdullah
Isler, Buket
Haider, Rana Zeeshan
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (02) : 2247 - 2261
[22] Real Time Performance Comparison of Multi-class Deep Learning Methods at the Edge
Sonmez, Doruk
Cetin, Aydin
ARTIFICIAL INTELLIGENCE AND APPLIED MATHEMATICS IN ENGINEERING PROBLEMS, 2020, 43 : 853 - 867
[23] Monthly streamflow prediction and performance comparison of machine learning and deep learning methods
Ayana, Omer
Kanbak, Deniz Furkan
Keles, Muemine Kaya
Turhan, Evren
ACTA GEOPHYSICA, 2023, 71 (06) : 2905 - 2922
[24] Monthly streamflow prediction and performance comparison of machine learning and deep learning methods
Ömer Ayana
Deniz Furkan Kanbak
Mümine Kaya Keleş
Evren Turhan
Acta Geophysica, 2023, 71 : 2905 - 2922
[25] Performance Analysis and CPU vs GPU Comparison for Deep Learning
Buber, Ebubekir
Diri, Banu
2018 6TH INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING & INFORMATION TECHNOLOGY (CEIT), 2018,
[26] Machine Learning and Deep Learning for Crop Disease Diagnosis: Performance Analysis and Review
Ngugi, Habiba Njeri
Akinyelu, Andronicus A.
Ezugwu, Absalom E.
AGRONOMY-BASEL, 2024, 14 (12):
[27] Detection of Monkeypox Disease from Human Skin Images with a Hybrid Deep Learning Model
Uysal, Fatih
DIAGNOSTICS, 2023, 13 (10)
[28] Generalization Performance of Deep Learning Models in Neurodegenerative Disease Classification
Yagis, Ekin
De Herrera, Alba G. Seco
Citi, Luca
2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 1692 - 1698
[29] Performance benchmark of deep learning human pose estimation for UAVs
Kalampokas, Theofanis
Krinidis, Stelios
Chatzis, Vassilios
Papakostas, George A.
MACHINE VISION AND APPLICATIONS, 2023, 34 (06)
[30] Performance benchmark of deep learning human pose estimation for UAVs
Theofanis Kalampokas
Stelios Krinidis
Vassilios Chatzis
George A. Papakostas
Machine Vision and Applications, 2023, 34

← 1 2 3 4 5 →