DeepCut: Unsupervised Segmentation using Graph Neural Networks Clustering

被引:20
作者
Aflalo, Amit [1 ]
Bagon, Shai [1 ]
Kashti, Tamar [1 ]
Eldar, Yonina [1 ]
机构
[1] Weizmann Inst Sci, Fac Math & Comp Sci, Rehovot, Israel
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW | 2023年
关键词
D O I
10.1109/ICCVW60793.2023.00010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image segmentation is a fundamental task in computer vision. Data annotation for training supervised methods can be labor-intensive, motivating unsupervised methods. Current approaches often rely on extracting deep features from pre-trained networks to construct a graph, and classical clustering methods like k-means and normalized-cuts are then applied as a post-processing step. However, this approach reduces the high-dimensional information encoded in the features to pair-wise scalar affinities. To address this limitation, this study introduces a lightweight Graph Neural Network (GNN) to replace classical clustering methods while optimizing for the same clustering objective function. Unlike existing methods, our GNN takes both the pair-wise affinities between local image features and the raw features as input. This direct connection between the raw features and the clustering objective enables us to implicitly perform classification of the clusters between different graphs, resulting in part semantic segmentation without the need for additional post-processing steps. We demonstrate how classical clustering objectives can be formulated as self-supervised loss functions for training an image segmentation GNN. Furthermore, we employ the Correlation-Clustering (CC) objective to perform clustering without defining the number of clusters, allowing for k-less clustering. We apply the proposed method for object localization, segmentation, and semantic part segmentation tasks, surpassing state-of-the-art performance on multiple benchmarks(1).
引用
收藏
页码:32 / 41
页数:10
相关论文
共 37 条
[1]  
Amir S, 2022, Arxiv, DOI arXiv:2112.05814
[2]   Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples [J].
Assran, Mahmoud ;
Caron, Mathilde ;
Misra, Ishan ;
Bojanowski, Piotr ;
Joulin, Armand ;
Ballas, Nicolas ;
Rabbat, Michael .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :8423-8432
[3]  
Bagon S, 2011, Arxiv, DOI arXiv:1112.2903
[4]   Correlation clustering [J].
Bansal, N ;
Blum, A ;
Chawla, S .
MACHINE LEARNING, 2004, 56 (1-3) :89-113
[5]   The Fast Bilateral Solver [J].
Barron, Jonathan T. ;
Poole, Ben .
COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 :617-632
[6]   OneGAN: Simultaneous Unsupervised Learning of Conditional Image Generation, Foreground Segmentation, and Fine-Grained Clustering [J].
Benny, Yaniv ;
Wolf, Lior .
COMPUTER VISION - ECCV 2020, PT XXVI, 2020, 12371 :514-530
[7]  
Bianchi FM, 2020, PR MACH LEARN RES, V119
[8]   Emerging Properties in Self-Supervised Vision Transformers [J].
Caron, Mathilde ;
Touvron, Hugo ;
Misra, Ishan ;
Jegou, Herve ;
Mairal, Julien ;
Bojanowski, Piotr ;
Joulin, Armand .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9630-9640
[9]   An Empirical Study of Training Self-Supervised Vision Transformers [J].
Chen, Xinlei ;
Xie, Saining ;
He, Kaiming .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9620-9629
[10]  
Choudhury Subhabrata, 2021, Advances in Neural Information Processing Systems, V34, P6