Deep Elastic Networks with Model Selection for Multi-Task Learning

被引:35
作者
Ahn, Chanho [1 ,2 ]
Kim, Eunwoo [3 ]
Oh, Songhwai [1 ,2 ]
机构
[1] Seoul Natl Univ, Dept ECE, Seoul, South Korea
[2] Seoul Natl Univ, ASRI, Seoul, South Korea
[3] Univ Oxford, Dept Engn Sci, Oxford, England
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
D O I
10.1109/ICCV.2019.00663
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we consider the problem of instance-wise dynamic network model selection for multi-task learning. To this end, we propose an efficient approach to exploit a compact but accurate model in a backbone architecture for each instance of all tasks. The proposed method consists of an estimator and a selector. The estimator is based on a backbone architecture and structured hierarchically. It can produce multiple different network models of different configurations in a hierarchical structure. The selector chooses a model dynamically from a pool of candidate models given an input instance. The selector is a relatively small-size network consisting of a few layers, which estimates a probability distribution over the candidate models when an input instance of a task is given. Both estimator and selector are jointly trained in a unified learning framework in conjunction with a sampling-based learning strategy, without additional computation steps. We demonstrate the proposed approach for several image classification tasks compared to existing approaches performing model selection or learning multiple tasks. Experimental results show that our approach gives not only outstanding performance compared to other competitors but also the versatility to perform instance-wise model selection for multiple tasks.
引用
收藏
页码:6528 / 6537
页数:10
相关论文
共 39 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
Ahmed K., 2018, ECCV
[3]  
[Anonymous], 2018, CoRR
[4]  
[Anonymous], 1998, Exploration and inference in learning from reinforcement
[5]  
Ashok A., 2018, PROC INT C LEARN REP
[6]   Large-Scale Machine Learning with Stochastic Gradient Descent [J].
Bottou, Leon .
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186
[7]   Multitask learning [J].
Caruana, R .
MACHINE LEARNING, 1997, 28 (01) :41-75
[8]   Fast Keyframe Selection and Switching for ICP-based Camera Pose Estimation [J].
Chen, Chun-Wei ;
Hsiao, Wen-Yuan ;
Lin, Ting-Yu ;
Wang, Jonas ;
Shieh, Ming-Der .
2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
[9]  
Coates A., 2011, INT C ART INT STAT, P215, DOI DOI 10.1177/1753193410390845
[10]  
Donahue J, 2014, PR MACH LEARN RES, V32