Local Style Preservation in Improved GAN-Driven Synthetic Image Generation for Endoscopic Tool Segmentation

被引:13
作者
Su, Yun-Hsuan [1 ]
Jiang, Wenfan [1 ]
Chitrakar, Digesh [2 ]
Huang, Kevin [2 ]
Peng, Haonan [3 ]
Hannaford, Blake [3 ]
机构
[1] Mt Holyoke Coll, Dept Comp Sci, 50 Coll St, S Hadley, MA 01075 USA
[2] Trinity Coll, Dept Engn, 300 Summit St, Hartford, CT 06106 USA
[3] Univ Washington, Paul Allen Ctr, Dept Elect & Comp Engn, 185 Stevens Way, Seattle, WA 98105 USA
基金
美国国家科学基金会;
关键词
robot-assisted minimally invasive surgery; surgical tool segmentation; generative adversarial networks; UNet; medical imaging; SURGICAL INSTRUMENT SEGMENTATION; IMPROVED U-NET; NETWORKS; SURGERY; ARCHITECTURE; VISION;
D O I
10.3390/s21155163
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Accurate semantic image segmentation from medical imaging can enable intelligent vision-based assistance in robot-assisted minimally invasive surgery. The human body and surgical procedures are highly dynamic. While machine-vision presents a promising approach, sufficiently large training image sets for robust performance are either costly or unavailable. This work examines three novel generative adversarial network (GAN) methods of providing usable synthetic tool images using only surgical background images and a few real tool images. The best of these three novel approaches generates realistic tool textures while preserving local background content by incorporating both a style preservation and a content loss component into the proposed multi-level loss function. The approach is quantitatively evaluated, and results suggest that the synthetically generated training tool images enhance UNet tool segmentation performance. More specifically, with a random set of 100 cadaver and live endoscopic images from the University of Washington Sinus Dataset, the UNet trained with synthetically generated images using the presented method resulted in 35.7% and 30.6% improvement over using purely real images in mean Dice coefficient and Intersection over Union scores, respectively. This study is promising towards the use of more widely available and routine screening endoscopy to preoperatively generate synthetic training tool images for intraoperative UNet tool segmentation.
引用
收藏
页数:22
相关论文
共 106 条
[1]   Image Based Surgical Instrument Pose Estimation with Multi-class Labelling and Optical Flow [J].
Allan, Max ;
Chang, Ping-Lin ;
Ourselin, Sebastien ;
Hawkes, David J. ;
Sridhar, Ashwin ;
Kelly, John ;
Stoyanov, Danail .
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2015, PT I, 2015, 9349 :331-338
[2]   Toward Detection and Localization of Instruments in Minimally Invasive Surgery [J].
Allan, Max ;
Ourselin, Sebastien ;
Thompson, Steve ;
Hawkes, David J. ;
Kelly, John ;
Stoyanov, Danail .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2013, 60 (04) :1050-1058
[3]   Surgical Tool Detection and Tracking in Retinal Microsurgery [J].
Alsheakhali, Mohamed ;
Yigitsoy, Mehmet ;
Eslami, Abouzar ;
Navab, Nassir .
MEDICAL IMAGING 2015: IMAGE-GUIDED PROCEDURES, ROBOTIC INTERVENTIONS, AND MODELING, 2015, 9415
[4]  
[Anonymous], 2018, LECT NOTES COMPUT SC
[5]   Fully automated 3D segmentation and separation of multiple cervical vertebrae in CT images using a 2D convolutional neural network [J].
Bae, Hyun-Jin ;
Hyun, Heejung ;
Byeon, Younghwa ;
Shin, Keewon ;
Cho, Yongwon ;
Song, Young Ji ;
Yi, Seong ;
Kuh, Sung-Uk ;
Yeom, Jin S. ;
Kim, Namkug .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 184
[6]   AdaResU-Net: Multiobjective adaptive convolutional neural network for medical image segmentation [J].
Baldeon-Calisto, Maria ;
Lai-Yuen, Susana K. .
NEUROCOMPUTING, 2020, 392 :325-340
[7]   Navigation in endoscopic soft tissue surgery: Perspectives and limitations [J].
Baumhauer, Matthias ;
Feuerstein, Marco ;
Meinzer, Hans-Peter ;
Rassweiler, J. .
JOURNAL OF ENDOUROLOGY, 2008, 22 (04) :751-766
[8]  
Bloice M. D., ARXIV170804680
[9]   Detecting Surgical Tools by Modelling Local Appearance and Global Shape [J].
Bouget, David ;
Benenson, Rodrigo ;
Omran, Mohamed ;
Riffaud, Laurent ;
Schiele, Bernt ;
Jannin, Pierre .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2015, 34 (12) :2603-2617
[10]   Assessment of Deep Generative Models for High-Resolution Synthetic Retinal Image Generation of Age-Related Macular Degeneration [J].
Burlina, Philippe M. ;
Joshi, Neil ;
Pacheco, Katia D. ;
Liu, T. Y. Alvin ;
Bressler, Neil M. .
JAMA OPHTHALMOLOGY, 2019, 137 (03) :258-264