Learning a Deep ConvNet for Multi-label Classification with Partial Labels

被引:144
作者
Durand, Thibaut [1 ]
Mehrasa, Nazanin [2 ]
Mori, Greg [2 ]
机构
[1] Borealis AI, Toronto, ON, Canada
[2] Simon Fraser Univ, Burnaby, BC, Canada
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00074
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep ConvNets have shown great performance for single-label image classification (e.g. ImageNet), but it is necessary to move beyond the single-label classification task because pictures of everyday life are inherently multilabel. Multi-label classification is a more difficult task than single-label classification because both the input images and output label spaces are more complex. Furthermore, collecting clean multi-label annotations is more difficult to scale-up than single-label annotations. To reduce the annotation cost, we propose to train a model with partial labels i.e. only some labels are known per image. We first empirically compare different labeling strategies to show the potential for using partial labels on multi-label datasets. Then to learn with partial labels, we introduce a new classification loss that exploits the proportion of known labels per example. Our approach allows the use of the same training settings as when learning with all the annotations. We further explore several curriculum learning based strategies to predict missing labels. Experiments are performed on three large-scale multi-label datasets: MS COCO, NUS-WIDE and Open Images.
引用
收藏
页码:647 / 657
页数:11
相关论文
共 64 条
[1]  
Alina KuznetsovaHassan Rom., 2018, The open images dataset v4: Unified image classification, object detection, and visual relationship detection at scale
[2]  
[Anonymous], 2015, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2015.7298668
[3]  
[Anonymous], 2018, DO BETTER IMAGENET M
[4]  
Baeza-Yates R, 1999, Modern Information Retrieval
[5]  
Bengio Y., 2009, P 26 ANN INT C MACH, P41, DOI [DOI 10.1145/1553374.1553380.EVENT-PLACE, 10.1145/1553374.1553380, DOI 10.1145/1553374.15533802,5]
[6]  
Bucak S. S., 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P2801, DOI 10.1109/CVPR.2011.5995734
[7]  
Cabral R. S., 2011, Advances in neural information processing systems, V201, P2
[8]  
Carlson Andrew, 2010, C ART INT AAAI
[9]  
Chapelle Olivier, 2010, Semi-Supervised Learning
[10]   Predicting Multiple Attributes via Relative Multi-task Learning [J].
Chen, Lin ;
Zhang, Qiang ;
Li, Baoxin .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1027-1034