An automated detection system for colonoscopy images using a dual encoder-decoder model

被引:7
作者
Hwang, Maxwell [1 ,2 ,3 ]
Wang, Da [1 ,2 ,3 ]
Kong, Xiang-Xing [1 ,2 ,3 ]
Wang, Zhanhuai [1 ,2 ,3 ]
Li, Jun [1 ,2 ,3 ]
Jiang, Wei-Cheng [4 ]
Hwang, Kao-Shing [5 ]
Ding, Kefeng [1 ,2 ,3 ]
机构
[1] Zhejiang Univ, Dept Colorectal Surg, Affiliated Hosp 2, Sch Med, Hangzhou, Peoples R China
[2] China Natl Minist Educ, Key Lab Mol Biol Med Sci, Key Lab Canc Prevent & Intervent, Canc Inst, Hangzhou, Zhejiang, Peoples R China
[3] Zhejiang Univ, Sch Med, Affiliated Hosp 2, Hangzhou, Peoples R China
[4] Tunghai Univ, Dept Elect Engn, Taichung, Taiwan
[5] Natl Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung, Taiwan
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Colorectal cancer; Computer-aided detection; Deep learning; Polyp detection; Convolutional neural network; POLYPS;
D O I
10.1016/j.compmedimag.2020.101763
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Conventional computer-aided detection systems (CADs) for colonoscopic images utilize shape, texture, or temporal information to detect polyps, so they have limited sensitivity and specificity. This study proposes a method to extract possible polyp features automatically using convolutional neural networks (CNNs). The objective of this work aims at building up a light-weight dual encoder-decoder model structure for polyp detection in colonoscopy Images. This proposed model, though with a relatively shallow structure, is expected to have the capability of a similar performance to the methods with much deeper structures. The proposed CAD model consists of two sequential encoder-decoder networks that consist of several CNN layers and full connection layers. The front end of the model is a hetero-associator (also known as hetero-encoder) that uses backpropagation learning to generate a set of reliably corrupted labeled images with a certain degree of similarity to a ground truth image, which eliminates the need for a large amount of training data that is usually required for medical images tasks. This dual CNN architecture generates a set of noisy images that are similar to the labeled data to train its counterpart, the auto-associator (also known as auto-encoder), in order to increase the successor's discriminative power in classification. The auto-encoder is also equipped with CNNs to simultaneously capture the features of the labeled images that contain noise. The proposed method uses features that are learned from open medical datasets and the dataset of Zhejiang University (ZJU), which contains around one thousand images. The performance of the proposed architecture is compared with a state-of-the-art detection model in terms of the metrics of the Jaccard index, the DICE similarity score, and two other geometric measures. The improvements in the performance of the proposed model are attributed to the effective reduction in false positives in the auto-encoder and the generation of noisy candidate images by the hetero-encoder. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
[21]   An Encoder-Decoder with a Residual Network for Fusing Hyperspectral and Panchromatic Remote Sensing Images [J].
Zhao, Rui ;
Du, Shihong .
REMOTE SENSING, 2022, 14 (09)
[22]   REDN: A Recursive Encoder-Decoder Network for Edge Detection [J].
Le, Truc ;
Duan, Ye .
IEEE ACCESS, 2020, 8 :90153-90164
[23]   Plant Disease Identification Based on Encoder-Decoder Model [J].
Feng, Wenfeng ;
Sun, Guoying ;
Zhang, Xin .
AGRONOMY-BASEL, 2024, 14 (10)
[24]   Automated segmentation of textured dust storms on mars remote sensing images using an encoder-decoder type convolutional neural network [J].
Ogohara, Kazunori ;
Gichu, Ryusei .
COMPUTERS & GEOSCIENCES, 2022, 160
[25]   A dual-stream encoder-decoder network with attention mechanism for saliency detection in video(s) [J].
Kumain, Sandeep Chand ;
Singh, Maheep ;
Awasthi, Lalit Kumar .
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (03) :2037-2046
[26]   Using LSTM encoder-decoder for rhetorical structure prediction [J].
de Moura, Gustavo Bennemann ;
Feltrim, Valeria Delisandra .
2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, :278-283
[27]   Alpha matting for portraits using encoder-decoder models [J].
Akshat Srivastava ;
Srivatsav Raghu ;
Abitha K Thyagarajan ;
Jayasri Vaidyaraman ;
Mohanaprasad Kothandaraman ;
Pavan Sudheendra ;
Avinav Goel .
Multimedia Tools and Applications, 2022, 81 :14517-14528
[28]   Image Segmentation Using Encoder-Decoder with Deformable Convolutions [J].
Gurita, Andreea ;
Mocanu, Irina Georgiana .
SENSORS, 2021, 21 (05) :1-27
[29]   Arabic Machine Transliteration using an Attention-based Encoder-decoder Model [J].
Ameur, Mohamed Seghir Hadj ;
Meziane, Farid ;
Guessoum, Ahmed .
ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2017), 2017, 117 :287-297
[30]   Alpha matting for portraits using encoder-decoder models [J].
Srivastava, Akshat ;
Raghu, Srivatsav ;
Thyagarajan, Abitha K. ;
Vaidyaraman, Jayasri ;
Kothandaraman, Mohanaprasad ;
Sudheendra, Pavan ;
Goel, Avinav .
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (10) :14517-14528