Performance Comparison of the Deep Learning and the Human Endoscopist for Bleeding Peptic Ulcer Disease

被引：27

作者：

Yen, Hsu-Heng ^{[1
,2
,3
]}

Wu, Ping-Yu ^{[3
]}

Su, Pei-Yuan ^{[1
]}

Yang, Chia-Wei ^{[1
]}

Chen, Yang-Yuan ^{[1
]}

Chen, Mei-Fen ^{[3
,5
]}

Lin, Wen-Chen ^{[5
]}

Tsai, Cheng-Lun ^{[4
,5
]}

Lin, Kang-Ping ^{[3
,5
]}

机构：

[1] Changhua Christian Hosp, Div Gastroenterol, Dept Internal Med, Changhua, Taiwan

[2] Chien Kuo Technol Univ, Gen Educ Ctr, Changhua, Taiwan

[3] Chung Yuan Christian Univ, Dept Elect Engn, Taoyuan, Taiwan

[4] Chung Yuan Christian Univ, Dept Biomed Engn, Taoyuan, Taiwan

[5] Chung Yuan Christian Univ, Technol Translat Ctr Med Device, 200 Chung Pei Rd, Taoyuan 32023, Taiwan

来源：

JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING | 2021年 / 41卷 / 04期

关键词：

Peptic ulcer; Bleeding; Deep learning; Artificial intelligence;

D O I：

10.1007/s40846-021-00608-0

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Purpose Management of peptic ulcer bleeding is clinically challenging. Accurate characterization of the bleeding during endoscopy is key for endoscopic therapy. This study aimed to assess whether a deep learning model can aid in the classification of bleeding peptic ulcer disease. Methods Endoscopic still images of patients (n = 1694) with peptic ulcer bleeding for the last 5 years were retrieved and reviewed. Overall, 2289 images were collected for deep learning model training, and 449 images were validated for the performance test. Two expert endoscopists classified the images into different classes based on their appearance. Four deep learning models, including Mobile Net V2, VGG16, Inception V4, and ResNet50, were proposed and pre-trained by ImageNet with the established convolutional neural network algorithm. A comparison of the endoscopists and trained deep learning model was performed to evaluate the model's performance on a dataset of 449 testing images. Results The results first presented the performance comparisons of four deep learning models. The Mobile Net V2 presented the optimal performance of the proposal models. The Mobile Net V2 was chosen for further comparing the performance with the diagnostic results obtained by one senior and one novice endoscopists. The sensitivity and specificity were acceptable for the prediction of "normal" lesions in both 3-class and 4-class classifications. For the 3-class category, the sensitivity and specificity were 94.83% and 92.36%, respectively. For the 4-class category, the sensitivity and specificity were 95.40% and 92.70%, respectively. The interobserver agreement of the testing dataset of the model was moderate to substantial with the senior endoscopist. The accuracy of the determination of endoscopic therapy required and high-risk endoscopic therapy of the deep learning model was higher than that of the novice endoscopist. Conclusions In this study, the deep learning model performed better than inexperienced endoscopists. Further improvement of the model may aid in clinical decision-making during clinical practice, especially for trainee endoscopist.

引用

页码：504 / 513

页数：10

共 50 条

[41] Predicting Human Performance in Vertical Menu Selection Using Deep Learning
Li, Yang
Bengio, Samy
Bailly, Gilles
PROCEEDINGS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2018), 2018,
[42] Scalogram based performance comparison of deep learning architectures for dysarthric speech detection
Shabber, Shaik Mulla
Sumesh, E. P.
Ramachandran, Vidhya Lavanya
ARTIFICIAL INTELLIGENCE REVIEW, 2025, 58 (05)
[43] Performance Comparison of Different EEG Analysis Techniques Based on Deep Learning Approaches
Belsare, Swarali
Kale, Maitreyi
Ghayal, Priya
Gogate, Aishwarya
Itkar, Suhasini
2021 INTERNATIONAL CONFERENCE ON EMERGING SMART COMPUTING AND INFORMATICS (ESCI), 2021, : 490 - 493
[44] Comparison Performance on SOTA Deep Learning Models for Coffee Beans Grading Inspection
Firdaus, Achmad Norman
Rakhmawati, Amalia
Nadhira, Vebi
Suprijanto
Juliastuti, Endang
Utomo, Gema Nuran
Risangtuni, Ayu Gareta
2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 100 - 105
[45] Freezing of Gait detection in Parkinson's disease: comparison of deep learning frameworks
Andrei, Alexandra-Georgiana
Tautan, Alexandra-Maria
Ionescu, Bogdan
2024 IEEE INTERNATIONAL SYMPOSIUM ON MEDICAL MEASUREMENTS AND APPLICATIONS, MEMEA 2024, 2024,
[46] Comparison of the Deep Learning Performance for Short-Term Power Load Forecasting
Son, Namrye
SUSTAINABILITY, 2021, 13 (22)
[47] Performance evaluation of plant leaf disease detection using deep learning models
Singh, Gulbir
Yogi, Kuldeep Kumar
ARCHIVES OF PHYTOPATHOLOGY AND PLANT PROTECTION, 2023, 56 (03) : 209 - 233
[48] A Performance Based Study on Deep Learning Algorithms in the Efficient Prediction of Heart Disease
Islam, Rakibul
Beeravolu, Abhijit Reddy
Islam, Md Al Habib
Karim, Asif
Azam, Sami
Mukti, Sanzida Akter
2ND INTERNATIONAL INFORMATICS AND SOFTWARE ENGINEERING CONFERENCE (IISEC), 2021,
[49] Performance of a deep learning based neural network in the selection of human blastocysts for implantation
Bormann, Charles L.
Kanakasabapathy, Manoj Kumar
Thirumalaraju, Prudhvi
Gupta, Raghav
Pooniwala, Rohan
Kandula, Hemanth
Hariton, Eduardo
Souter, Irene
Dimitriadis, Irene
Ramirez, Leslie B.
Curchoe, Carol L.
Swain, Jason
Boehnlein, Lynn M.
Shafiee, Hadi
ELIFE, 2020, 9
[50] COMPARISON OF DEEP LEARNING MODEL PERFORMANCE BETWEEN META-DATASET TRAINING VERSUS DEEP NEURAL ENSEMBLES
Hurt, J. Alex
Scott, Grant J.
Davis, Curt H.
2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 1326 - 1329

← 1 2 3 4 5 →