Performance Comparison of the Deep Learning and the Human Endoscopist for Bleeding Peptic Ulcer Disease

被引：27

作者：

Yen, Hsu-Heng ^{[1
,2
,3
]}

Wu, Ping-Yu ^{[3
]}

Su, Pei-Yuan ^{[1
]}

Yang, Chia-Wei ^{[1
]}

Chen, Yang-Yuan ^{[1
]}

Chen, Mei-Fen ^{[3
,5
]}

Lin, Wen-Chen ^{[5
]}

Tsai, Cheng-Lun ^{[4
,5
]}

Lin, Kang-Ping ^{[3
,5
]}

机构：

[1] Changhua Christian Hosp, Div Gastroenterol, Dept Internal Med, Changhua, Taiwan

[2] Chien Kuo Technol Univ, Gen Educ Ctr, Changhua, Taiwan

[3] Chung Yuan Christian Univ, Dept Elect Engn, Taoyuan, Taiwan

[4] Chung Yuan Christian Univ, Dept Biomed Engn, Taoyuan, Taiwan

[5] Chung Yuan Christian Univ, Technol Translat Ctr Med Device, 200 Chung Pei Rd, Taoyuan 32023, Taiwan

来源：

JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING | 2021年 / 41卷 / 04期

关键词：

Peptic ulcer; Bleeding; Deep learning; Artificial intelligence;

D O I：

10.1007/s40846-021-00608-0

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Purpose Management of peptic ulcer bleeding is clinically challenging. Accurate characterization of the bleeding during endoscopy is key for endoscopic therapy. This study aimed to assess whether a deep learning model can aid in the classification of bleeding peptic ulcer disease. Methods Endoscopic still images of patients (n = 1694) with peptic ulcer bleeding for the last 5 years were retrieved and reviewed. Overall, 2289 images were collected for deep learning model training, and 449 images were validated for the performance test. Two expert endoscopists classified the images into different classes based on their appearance. Four deep learning models, including Mobile Net V2, VGG16, Inception V4, and ResNet50, were proposed and pre-trained by ImageNet with the established convolutional neural network algorithm. A comparison of the endoscopists and trained deep learning model was performed to evaluate the model's performance on a dataset of 449 testing images. Results The results first presented the performance comparisons of four deep learning models. The Mobile Net V2 presented the optimal performance of the proposal models. The Mobile Net V2 was chosen for further comparing the performance with the diagnostic results obtained by one senior and one novice endoscopists. The sensitivity and specificity were acceptable for the prediction of "normal" lesions in both 3-class and 4-class classifications. For the 3-class category, the sensitivity and specificity were 94.83% and 92.36%, respectively. For the 4-class category, the sensitivity and specificity were 95.40% and 92.70%, respectively. The interobserver agreement of the testing dataset of the model was moderate to substantial with the senior endoscopist. The accuracy of the determination of endoscopic therapy required and high-risk endoscopic therapy of the deep learning model was higher than that of the novice endoscopist. Conclusions In this study, the deep learning model performed better than inexperienced endoscopists. Further improvement of the model may aid in clinical decision-making during clinical practice, especially for trainee endoscopist.

引用

页码：504 / 513

页数：10

共 50 条

[31] A COMPARISON OF THE MENTAL STATUS, PERSONALITY PROFILES AND LIFE EVENTS OF PATIENTS WITH IRRITABLE-BOWEL-SYNDROME AND PEPTIC-ULCER DISEASE
DINAN, TG
OKEANE, V
OBOYLE, C
CHUA, A
KEELING, PWN
ACTA PSYCHIATRICA SCANDINAVICA, 1991, 84 (01) : 26 - 28
[32] Performance Comparison of Deep Learning Models for Damage Identification of Aging Bridges
Chung, Su-Wan
Hong, Sung-Sam
Kim, Byung-Kon
APPLIED SCIENCES-BASEL, 2023, 13 (24):
[33] Development of deep learning segmentation models for coronary X-ray angiography: Quality assessment by a new global segmentation score and comparison with human performance
Menezes, Miguel Nobre
Lourenco-Silva, Joao
Silva, Beatriz
Rodrigues, Oliveira
Francisco, Ana Rita G.
Ferreira, Pedro Carrilho
Oliveira, Arlindo L.
Pinto, Fausto J.
REVISTA PORTUGUESA DE CARDIOLOGIA, 2022, 41 (12) : 1011 - 1021
[34] Artificial intelligence-based analysis of whole-body bone scintigraphy: The quest for the optimal deep learning algorithm and comparison with human observer performance
Hajianfar, Ghasem
Sabouri, Maziar
Salimi, Yazdan
Amini, Mehdi
Bagheri, Soroush
Jenabi, Elnaz
Hekmat, Sepideh
Maghsudi, Mehdi
Mansouri, Zahra
Khateri, Maziar
Jamshidi, Mohammad Hosein
Jafari, Esmail
Rajabi, Ahmad Bitarafan
Assadi, Majid
Oveisi, Mehrdad
Shiri, Isaac
Zaidi, Habib
ZEITSCHRIFT FUR MEDIZINISCHE PHYSIK, 2024, 34 (02): : 242 - 257
[35] Performance Comparison of Pretrained Deep Learning Models for Landfill Waste Classification
Younis, Hussein
Obaid, Mahmoud
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (11) : 689 - 698
[36] A Comprehensive Study of Deep Learning and Performance Comparison of Deep Neural Network Models (YOLO, RetinaNet)
Nife, Nadia Ibrahim
Chtourou, Mohamed
INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (12) : 62 - 77
[37] Comparison of Deep Transfer Learning Techniques in Human Skin Burns Discrimination
Abubakar, Aliyu
Ajuji, Mohammed
Yahya, Ibrahim Usman
APPLIED SYSTEM INNOVATION, 2020, 3 (02) : 1 - 15
[38] Machine Learning and Deep Learning Models for Diagnosis of Parkinson's Disease: A Performance Analysis
Mounika, P.
Rao, S. Govinda
PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 381 - 388
[39] Performance Comparison of Machine Learning and Deep Learning While Classifying Driver's Cognitive State
Bhardwaj, Rahul
Parameswaran, Swathy
Balasubramanian, Venkatesh
2018 IEEE 13TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (IEEE ICIIS), 2018, : 102 - 106
[40] Performance analysis of deep learning architectures for plant leaves disease detection
Dahiya S.
Gulati T.
Gupta D.
Measurement: Sensors, 2022, 24

← 1 2 3 4 5 →