An automated detection system for colonoscopy images using a dual encoder-decoder model

被引:7
作者
Hwang, Maxwell [1 ,2 ,3 ]
Wang, Da [1 ,2 ,3 ]
Kong, Xiang-Xing [1 ,2 ,3 ]
Wang, Zhanhuai [1 ,2 ,3 ]
Li, Jun [1 ,2 ,3 ]
Jiang, Wei-Cheng [4 ]
Hwang, Kao-Shing [5 ]
Ding, Kefeng [1 ,2 ,3 ]
机构
[1] Zhejiang Univ, Dept Colorectal Surg, Affiliated Hosp 2, Sch Med, Hangzhou, Peoples R China
[2] China Natl Minist Educ, Key Lab Mol Biol Med Sci, Key Lab Canc Prevent & Intervent, Canc Inst, Hangzhou, Zhejiang, Peoples R China
[3] Zhejiang Univ, Sch Med, Affiliated Hosp 2, Hangzhou, Peoples R China
[4] Tunghai Univ, Dept Elect Engn, Taichung, Taiwan
[5] Natl Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung, Taiwan
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Colorectal cancer; Computer-aided detection; Deep learning; Polyp detection; Convolutional neural network; POLYPS;
D O I
10.1016/j.compmedimag.2020.101763
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Conventional computer-aided detection systems (CADs) for colonoscopic images utilize shape, texture, or temporal information to detect polyps, so they have limited sensitivity and specificity. This study proposes a method to extract possible polyp features automatically using convolutional neural networks (CNNs). The objective of this work aims at building up a light-weight dual encoder-decoder model structure for polyp detection in colonoscopy Images. This proposed model, though with a relatively shallow structure, is expected to have the capability of a similar performance to the methods with much deeper structures. The proposed CAD model consists of two sequential encoder-decoder networks that consist of several CNN layers and full connection layers. The front end of the model is a hetero-associator (also known as hetero-encoder) that uses backpropagation learning to generate a set of reliably corrupted labeled images with a certain degree of similarity to a ground truth image, which eliminates the need for a large amount of training data that is usually required for medical images tasks. This dual CNN architecture generates a set of noisy images that are similar to the labeled data to train its counterpart, the auto-associator (also known as auto-encoder), in order to increase the successor's discriminative power in classification. The auto-encoder is also equipped with CNNs to simultaneously capture the features of the labeled images that contain noise. The proposed method uses features that are learned from open medical datasets and the dataset of Zhejiang University (ZJU), which contains around one thousand images. The performance of the proposed architecture is compared with a state-of-the-art detection model in terms of the metrics of the Jaccard index, the DICE similarity score, and two other geometric measures. The improvements in the performance of the proposed model are attributed to the effective reduction in false positives in the auto-encoder and the generation of noisy candidate images by the hetero-encoder. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
[41]   A multitask encoder-decoder model for quality prediction in injection moulding [J].
Muaz, Muhammad ;
Yu, Hanxin ;
Sung, Wai Lam ;
Liu, Chang ;
Drescher, Benny .
JOURNAL OF MANUFACTURING PROCESSES, 2023, 103 :238-247
[42]   Hybrid Encoder-Decoder Model for Retinal Blood Vessels Segmentation [J].
Sule, Olubunmi Omobola .
PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2021), 2022, 417 :524-534
[43]   HeteroNet: a heterogeneous encoder-decoder network for sea-land segmentation of remote sensing images [J].
Ji, Xun ;
Tang, Longbin ;
Liu, Tianhe ;
Guo, Hui .
JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (05)
[44]   A novel deep convolutional encoder-decoder network: application to moving object detection in videos [J].
Ganivada, Avatharam ;
Yara, Srinivas .
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (29) :22027-22041
[45]   Quantum Mayfly Optimization with Encoder-Decoder Driven LSTM Networks for Malware Detection and Classification Model [J].
Omar A. Alzubi ;
Jafar A. Alzubi ;
Tareq Mahmod Alzubi ;
Ashish Singh .
Mobile Networks and Applications, 2023, 28 :795-807
[46]   VISIBLE AND INFRARED IMAGE FUSION USING ENCODER-DECODER NETWORK [J].
Ataman, Ferhat Can ;
Bozdagi Akar, Gozde .
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, :1779-1783
[47]   Encoder-decoder based convolutional neural networks for image forgery detection [J].
El Biach, Fatima Zahra ;
Iala, Imad ;
Laanaya, Hicham ;
Minaoui, Khalid .
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (16) :22611-22628
[48]   Quantum Mayfly Optimization with Encoder-Decoder Driven LSTM Networks for Malware Detection and Classification Model [J].
Alzubi, Omar A. ;
Alzubi, Jafar A. ;
Alzubi, Tareq Mahmod ;
Singh, Ashish .
MOBILE NETWORKS & APPLICATIONS, 2023, 28 (02) :795-807
[49]   SAS-UNet: Modified encoder-decoder network for the segmentation of obscenity in images [J].
Samal, Sonali ;
Gadekellu, Thippa Reddy ;
Rajput, Pankaj ;
Zhang, Yu-Dong ;
Balabantaray, Bunil Kumar .
2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING WORKSHOPS, CCGRIDW, 2023, :45-51
[50]   Inferring contextual preferences using deep encoder-decoder learners [J].
Unger, Moshe ;
Shapira, Bracha ;
Rokach, Lior ;
Livne, Amit .
NEW REVIEW OF HYPERMEDIA AND MULTIMEDIA, 2018, 24 (03) :262-290