Learning a Deep ConvNet for Multi-label Classification with Partial Labels

被引:144
作者
Durand, Thibaut [1 ]
Mehrasa, Nazanin [2 ]
Mori, Greg [2 ]
机构
[1] Borealis AI, Toronto, ON, Canada
[2] Simon Fraser Univ, Burnaby, BC, Canada
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00074
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep ConvNets have shown great performance for single-label image classification (e.g. ImageNet), but it is necessary to move beyond the single-label classification task because pictures of everyday life are inherently multilabel. Multi-label classification is a more difficult task than single-label classification because both the input images and output label spaces are more complex. Furthermore, collecting clean multi-label annotations is more difficult to scale-up than single-label annotations. To reduce the annotation cost, we propose to train a model with partial labels i.e. only some labels are known per image. We first empirically compare different labeling strategies to show the potential for using partial labels on multi-label datasets. Then to learn with partial labels, we introduce a new classification loss that exploits the proportion of known labels per example. Our approach allows the use of the same training settings as when learning with all the annotations. We further explore several curriculum learning based strategies to predict missing labels. Experiments are performed on three large-scale multi-label datasets: MS COCO, NUS-WIDE and Open Images.
引用
收藏
页码:647 / 657
页数:11
相关论文
共 64 条
[11]  
Chen Minmin, 2013, INT C MACH LEARN ICM, P2
[12]   NEIL: Extracting Visual Knowledge from Web Data [J].
Chen, Xinlei ;
Shrivastava, Abhinav ;
Gupta, Abhinav .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1409-1416
[13]  
Cho K., 2014, P SSST8 8 WORKSH SYN, P103, DOI 10.3115/v1/w14-4012
[14]  
Chu Hong-Min, 2018, EUR C COMP VIS ECCV
[15]  
Chua T.-S., 2009, P ACM INT C IM VID R
[16]  
Cour T, 2011, J MACH LEARN RES, V12, P1501
[17]  
Deng J, 2014, P SIGCHI C HUM FACT
[18]  
DURAND T, 2017, PROC CVPR IEEE, P5957, DOI DOI 10.1109/CVPR.2017.631
[19]   Exploiting Negative Evidence for Deep Latent Structured Models [J].
Durand, Thibaut ;
Thome, Nicolas ;
Cord, Matthieu .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (02) :337-351
[20]   WELDON: Weakly Supervised Learning of Deep Convolutional Neural Networks [J].
Durand, Thibaut ;
Thome, Nicolas ;
Cord, Matthieu .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4743-4752