Unsupervised Learning of Foreground Object Segmentation

被引：32

作者：

Croitoru, Ioana ^{[1
]}

Bogolin, Simion-Vlad ^{[1
]}

Leordeanu, Marius ^{[1
,2
]}

机构：

[1] Romanian Acad, Inst Math, 21 Calea Grivitei, Bucharest, Romania

[2] Univ Politehn Bucuresti, 313 Splaiul Independentei, Bucharest, Romania

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2019年 / 127卷 / 09期

关键词：

Unsupervised learning; Foreground object segmentation; Object discovery in video; Transfer learning;

D O I：

10.1007/s11263-019-01183-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Unsupervised learning represents one of the most interesting challenges in computer vision today. The task has an immense practical value with many applications in artificial intelligence and emerging technologies, as large quantities of unlabeled images and videos can be collected at low cost. In this paper, we address the unsupervised learning problem in the context of segmenting the main foreground objects in single images. We propose an unsupervised learning system, which has two pathways, the teacher and the student, respectively. The system is designed to learn over several generations of teachers and students. At every generation the teacher performs unsupervised object discovery in videos or collections of images and an automatic selection module picks up good frame segmentations and passes them to the student pathway for training. At every generation multiple students are trained, with different deep network architectures to ensure a better diversity. The students at one iteration help in training a better selection module, forming together a more powerful teacher pathway at the next iteration. In experiments, we show that the improvement in the selection power, the training of multiple students and the increase in unlabeled data significantly improve segmentation accuracy from one generation to the next. Our method achieves top results on three current datasets for object discovery in video, unsupervised image segmentation and saliency detection. At test time, the proposed system is fast, being one to two orders of magnitude faster than published unsupervised methods. We also test the strength of our unsupervised features within a well known transfer learning setup and achieve competitive performance, proving that our unsupervised approach can be reliably used in a variety of computer vision tasks.

引用

页码：1279 / 1302

页数：24

共 88 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2] Learning to See by Moving [J].

Agrawal, Pulkit ;

Carreira, Joao ;

Malik, Jitendra .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :37-45

[3]

[Anonymous], 2017, CVPR

[4]

[Anonymous], 2007, P 24 INT C MACH LEAR

[5]

[Anonymous], 2016, ECCV

[6]

[Anonymous], 2010, CVPR

[7]

[Anonymous], 2012, ECCV

[8]

[Anonymous], 2017, P CVPR

[9]

[Anonymous], 2016, ECCV

[10]

[Anonymous], 2013, CVPR

← 1 2 3 4 5 6 7 8 9 →