Histogram of Oriented Gradients meet deep learning: A novel multi-task deep network for 2D surgical image semantic segmentation

被引:35
作者
Bhattarai, Binod [1 ,4 ]
Subedi, Ronast [2 ]
Gaire, Rebati Raman [2 ]
Vazquez, Eduard [3 ]
Stoyanov, Danail [1 ]
机构
[1] UCL, London, England
[2] Nepal Appl Math & Informat Inst Res NAAMII, Lalitpur, Nepal
[3] Redev Technol, Leeds, England
[4] Univ Aberdeen, Aberdeen, Scotland
基金
英国工程与自然科学研究理事会; 欧盟地平线“2020”;
关键词
Semantic segmentation; Multi-task learning; Self-supervised learning; Histogram of Oriented Gradients; CONVOLUTIONAL NEURAL-NETWORKS; CLASSIFICATION; AGGREGATION;
D O I
10.1016/j.media.2023.102747
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present our novel deep multi-task learning method for medical image segmentation. Existing multi-task methods demand ground truth annotations for both the primary and auxiliary tasks. Contrary to it, we propose to generate the pseudo-labels of an auxiliary task in an unsupervised manner. To generate the pseudo-labels, we leverage Histogram of Oriented Gradients (HOGs), one of the most widely used and powerful hand-crafted features for detection. Together with the ground truth semantic segmentation masks for the primary task and pseudo-labels for the auxiliary task, we learn the parameters of the deep network to minimize the loss of both the primary task and the auxiliary task jointly. We employed our method on two powerful and widely used semantic segmentation networks: UNet and U2Net to train in a multi-task setup. To validate our hypothesis, we performed experiments on two different medical image segmentation data sets. From the extensive quantitative and qualitative results, we observe that our method consistently improves the performance compared to the counter-part method. Moreover, our method is the winner of FetReg Endovis Sub-challenge on Semantic Segmentation organised in conjunction with MICCAI 2021. Code and implementation details are available at:https://github.com/thetna/medical_image_segmentation.
引用
收藏
页数:11
相关论文
共 54 条
[1]  
Allan M., 2019, arXiv
[2]   Medical Image Analysis using Convolutional Neural Networks: A Review [J].
Anwar, Syed Muhammad ;
Majid, Muhammad ;
Qayyum, Adnan ;
Awais, Muhammad ;
Alnowami, Majdi ;
Khan, Muhammad Khurram .
JOURNAL OF MEDICAL SYSTEMS, 2018, 42 (11)
[3]   Multitask learning [J].
Caruana, R .
MACHINE LEARNING, 1997, 28 (01) :41-75
[4]  
Chakravarty A, 2018, Arxiv, DOI [arXiv:1808.01355, DOI 10.48550/ARXIV.1808.01355]
[5]  
Chaurasia A, 2017, 2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP)
[6]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[7]  
Colleoni E, 2020, INT C MED IM COMP CO, P700, DOI 10.1007/978-3-030-59716-0_67
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]  
Dong J, 2014, LECT NOTES COMPUT SC, V8693, P299, DOI 10.1007/978-3-319-10602-1_20
[10]   Blood vessel segmentation methodologies in retinal images - A survey [J].
Fraz, M. M. ;
Remagnino, P. ;
Hoppe, A. ;
Uyyanonvara, B. ;
Rudnicka, A. R. ;
Owen, C. G. ;
Barman, S. A. .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2012, 108 (01) :407-433