Semi-Supervised Learning With Deep Embedded Clustering for Image Classification and Segmentation

被引:68
作者
Enguehard, Joseph [1 ,2 ,3 ]
O'Halloran, Peter [4 ,5 ]
Gholipour, Ali [1 ,2 ]
机构
[1] Boston Childrens Hosp, Dept Radiol, Computat Radiol Lab, Boston, MA 02115 USA
[2] Harvard Med Sch, Boston, MA 02115 USA
[3] Telecom ParisTech, F-75013 Paris, France
[4] Mt Auburn Hosp, Dept Radiol, Cambridge, MA 02138 USA
[5] Harvard Univ, Cambridge, MA 02138 USA
关键词
Deep learning; semi-supervised learning; deep embedded clustering; image segmentation; CONVOLUTIONAL NEURAL-NETWORKS; FRAMEWORK; MRI;
D O I
10.1109/ACCESS.2019.2891970
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks usually require large labeled datasets to construct accurate models; however, in many real-world scenarios, such as medical image segmentation, labeling data are a time-consuming and costly human (expert) intelligent task. Semi-supervised methods leverage this issue by making use of a small labeled dataset and a larger set of unlabeled data. In this paper, we present a flexible framework for semi-supervised learning that combines the power of supervised methods that learn feature representations using state-of-the-art deep convolutional neural networks with the deeply embedded clustering algorithm that assigns data points to clusters based on their probability distributions and feature representations learned by the networks. Our proposed semi-supervised learning algorithm based on deeply embedded clustering (SSLDEC) learns feature representations via iterations by alternatively using labeled and unlabeled data points and computing target distributions from predictions. During this iterative procedure, the algorithm uses labeled samples to keep the model consistent and tuned with labeling, as it simultaneously learns to improve feature representation and predictions. The SSLDEC requires a few hyper-parameters and thus does not need large labeled validation sets, which addresses one of the main limitations of many semi-supervised learning algorithms. It is also flexible and can be used with many stateof-the-art deep neural network configurations for image classification and segmentation tasks. To this end, we implemented and tested our approach on benchmark image classification tasks as well as in a challenging medical image segmentation scenario. In benchmark classification tasks, the SSLDEC outperformed several state-of-the-art semi-supervised learning methods, achieving 0.46% error on MNIST with 1000 labeled points and 4.43% error on SVHN with 500 labeled points. In the iso-intense infant brain MRI tissue segmentation task, we implemented SSLDEC on a 3D densely connected fully convolutional neural network where we achieved significant improvement over supervised-only training as well as a semi-supervised method based on pseudo-labeling. Our results show that the SSLDEC can be effectively used to reduce the need for costly expert annotations, enhancing applications, such as automatic medical image segmentation.
引用
收藏
页码:11093 / 11104
页数:12
相关论文
共 48 条
[1]  
An Le, 2016, Med Image Comput Comput Assist Interv, V9901, P79, DOI 10.1007/978-3-319-46723-8_10
[2]  
[Anonymous], 6 MONTHS INFANT BRAI
[3]  
[Anonymous], 2015, Nature, DOI [10.1038/nature14539, DOI 10.1038/NATURE14539]
[4]  
[Anonymous], 2016, Deep learning. vol
[5]  
[Anonymous], 2015, INT C LEARNING REPRE
[6]  
[Anonymous], P IEEE C COMP VIS PA
[7]  
[Anonymous], 2015, PREPRINT
[8]  
Buitinck Lars, 2013, ECML PKDD WORKSH LAN, P108, DOI DOI 10.48550/ARXIV.1309.0238
[9]  
Cheplygina V., 2018, Not-so-supervised: a survey of semi-supervised, multi-instance, and transfer learning in medical image analysis
[10]  
Cheung V., 2016, P NIPS, P2234