Contextual Diversity for Active Learning

被引：82

作者：

Agarwal, Sharat ^{[1
]}

Arora, Himanshu ^{[1
,2
]}

Anand, Saket ^{[1
]}

Arora, Chetan ^{[3
]}

机构：

[1] IIIT Delhi, New Delhi, India

[2] Flixstock Inc, New Delhi, India

[3] Indian Inst Technol Delhi, New Delhi, India

来源：

COMPUTER VISION - ECCV 2020, PT XVI | 2020年 / 12361卷

关键词：

D O I：

10.1007/978-3-030-58517-4_9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Requirement of large annotated datasets restrict the use of deep convolutional neural networks (CNNs) for many practical applications. The problem can be mitigated by using active learning (AL) techniques which, under a given annotation budget, allow to select a subset of data that yields maximum accuracy upon fine tuning. State of the art AL approaches typically rely on measures of visual diversity or prediction uncertainty, which are unable to effectively capture the variations in spatial context. On the other hand, modern CNN architectures make heavy use of spatial context for achieving highly accurate predictions. Since the context is difficult to evaluate in the absence of ground-truth labels, we introduce the notion of contextual diversity that captures the confusion associated with spatially co-occurring classes. Contextual Diversity (CD) hinges on a crucial observation that the probability vector predicted by a CNN for a region of interest typically contains information from a larger receptive field. Exploiting this observation, we use the proposed CD measure within two AL frameworks: (1) a core-set based strategy and (2) a reinforcement learning based policy, for active frame selection. Our extensive empirical evaluation establish state of the art results for active learning on benchmark datasets of Semantic Segmentation, Object Detection and Image classification. Our ablation studies show clear advantages of using contextual diversity for active learning. The source code and additional results are available at https://github.com/sharat29ag/CDAL.

引用

页码：137 / 153

页数：17

共 44 条

[1]

[Anonymous], 2018, Deep reinforcement learning for unsupervised video summarization with diversity-representativeness reward

[2]

Arazo E., 2019, Pseudolabeling and confirmation bias in deep semi-supervised learning

[3] The power of ensembles for active learning in image classification [J].

Beluch, William H. ;

Genewein, Tim ;

Nuernberger, Andreas ;

Koehler, Jan M. .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9368-9377

[4]

Bilgic M., 2009, NIPS WORKSH AN NETW, V4

[5] Deep Clustering for Unsupervised Learning of Visual Features [J].

Caron, Mathilde ;

Bojanowski, Piotr ;

Joulin, Armand ;

Douze, Matthijs .

COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :139-156

[6]

Catlett J., 1994, MACHINE LEARN ING P, P148

[7] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[8]

Craven M, 2008, P 2008 C EMP METH NA, P1070

[9]

Dabak A.G., 1992, A geometry for detection theory

[10]

Ebert S, 2012, PROC CVPR IEEE, P3626, DOI 10.1109/CVPR.2012.6248108

← 1 2 3 4 5 →