Fully-adaptive Feature Sharing in Multi-Task Networks with Applications in Person Attribute Classification

被引:229
作者
Lu, Yongxi [1 ]
Kumar, Abhishek [2 ]
Zhai, Shuangfei [3 ]
Cheng, Yu [2 ]
Javidi, Tara [1 ]
Feris, Rogerio [2 ]
机构
[1] Univ Calif San Diego, La Jolla, CA 92093 USA
[2] IBM Res, San Jose, CA USA
[3] SUNY Binghamton, Binghamton, NY 13902 USA
来源
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年
关键词
SIMULTANEOUS SPARSE APPROXIMATION; ALGORITHMS;
D O I
10.1109/CVPR.2017.126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-task learning aims to improve generalization performance of multiple prediction tasks by appropriately sharing relevant information across them. In the context of deep neural networks, this idea is often realized by hand-designed network architectures with layers that are shared across tasks and branches that encode task-specific features. However, the space of possible multi-task deep architectures is combinatorially large and often the final architecture is arrived at by manual exploration of this space, which can be both error-prone and tedious. We propose an automatic approach for designing compact multi-task deep learning architectures. Our approach starts with a thin multi-layer network and dynamically widens it in a greedy manner during training. By doing so iteratively, it creates a tree-like deep architecture, on which similar tasks reside in the same branch until at the top layers. Evaluation on person attributes classification tasks involving facial and clothing attributes suggests that the models produced by the proposed method are fast, compact and can closely match or exceed the state-of-the-art accuracy from strong baselines by much more expensive models.
引用
收藏
页码:1131 / 1140
页数:10
相关论文
共 50 条
[1]  
[Anonymous], 2015, NIPS
[2]  
[Anonymous], ACM MULTIMEDIA
[3]  
[Anonymous], ARXIV160902132
[4]  
[Anonymous], 2009, IEEE WINT C APPL COM
[5]  
[Anonymous], 2015, ICCV
[6]  
[Anonymous], 2016, CVPR
[7]  
[Anonymous], ICASSP
[8]  
[Anonymous], 2016, ECCV
[9]  
[Anonymous], IEEE T PATTERN ANAL
[10]  
[Anonymous], 2016, INT J COMPUTER VISIO