Semi-supervised Learning with a Teacher-Student Network for Generalized Attribute Prediction

被引:7
作者
Shin, Minchul [1 ]
机构
[1] Search Solut Inc, Seoul, Gyeonggi Do, South Korea
来源
COMPUTER VISION - ECCV 2020, PT XI | 2020年 / 12356卷
关键词
Semi-supervised learning; Unlabeled data; Visual attributes;
D O I
10.1007/978-3-030-58621-8_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a study on semi-supervised learning to solve the visual attribute prediction problem. In many applications of vision algorithms, the precise recognition of visual attributes of objects is important but still challenging. This is because defining a class hierarchy of attributes is ambiguous, so training data inevitably suffer from class imbalance and label sparsity, leading to a lack of effective annotations. An intuitive solution is to find a method to effectively learn image representations by utilizing unlabeled images. With that in mind, we propose a multi-teacher-single-student (MTSS) approach inspired by the multi-task learning and the distillation of semi-supervised learning. Our MTSS learns task-specific domain experts called teacher networks using the label embedding technique and learns a unified model called a student network by forcing a model to mimic the distributions learned by domain experts. Our experiments demonstrate that our method not only achieves competitive performance on various benchmarks for fashion attribute prediction, but also improves robustness and cross-domain adaptability for unseen domains.
引用
收藏
页码:509 / 525
页数:17
相关论文
共 44 条
[1]   Multi-Task CNN Model for Attribute Prediction [J].
Abdulnabi, Abrar H. ;
Wang, Gang ;
Lu, Jiwen ;
Jia, Kui .
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) :1949-1959
[2]   Label-Embedding for Image Classification [J].
Akata, Zeynep ;
Perronnin, Florent ;
Harchaoui, Zaid ;
Schmid, Cordelia .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (07) :1425-1438
[3]  
[Anonymous], 2009, P INT C NEUR INF PRO
[4]   Multimodal Sequential Fashion Attribute Prediction [J].
Arslan, Hasan Sait ;
Sirts, Kairit ;
Fishel, Mark ;
Anbarjafari, Gholamreza .
INFORMATION, 2019, 10 (10)
[5]   Semi-supervised robust deep neural networks for multi-label image classification [J].
Cevikalp, Hakan ;
Benligiray, Burak ;
Gerek, Omer Nezih .
PATTERN RECOGNITION, 2020, 100
[6]   Describing Clothing by Semantic Attributes [J].
Chen, Huizhong ;
Gallagher, Andrew ;
Girod, Bernd .
COMPUTER VISION - ECCV 2012, PT III, 2012, 7574 :609-623
[7]   Multi-Label Image Recognition with Graph Convolutional Networks [J].
Chen, Zhao-Min ;
Wei, Xiu-Shen ;
Wang, Peng ;
Guo, Yanwen .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5172-5181
[8]   Leveraging Weakly Annotated Data for Fashion Image Retrieval and Label Prediction [J].
Corbiere, Charles ;
Ben-Younes, Hedi ;
Rame, Alexandre ;
Ollion, Charles .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :2268-2274
[9]   When is Undersampling Effective in Unbalanced Classification Tasks? [J].
Dal Pozzolo, Andrea ;
Caelen, Olivier ;
Bontempi, Gianluca .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT I, 2015, 9284 :200-215
[10]   Pose Guided Attention for Multi-label Fashion Image Classification [J].
Ferreira, Beatriz Quintino ;
Costeira, Joao P. ;
Sousa, Ricardo G. ;
Gui, Liang-Yan ;
Gomes, Joao P. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :3125-3128