Cross-stitch Networks for Multi-task Learning

被引:827
作者
Misra, Ishan [1 ]
Shrivastava, Abhinav [1 ]
Gupta, Abhinav [1 ]
Hebert, Martial [1 ]
机构
[1] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
关键词
D O I
10.1109/CVPR.2016.433
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-task learning in Convolutional Networks has displayed remarkable success in the field of recognition. This success can be largely attributed to learning shared representations from multiple supervisory tasks. However, existing multi-task approaches rely on enumerating multiple network architectures specific to the tasks at hand, that do not generalize. In this paper, we propose a principled approach to learn shared representations in ConvNets using multi-task learning. Specifically, we propose a new sharing unit: "cross-stitch" unit. These units combine the activations from multiple networks and can be trained end-to-end. A network with cross-stitch units can learn an optimal combination of shared and task-specific representations. Our proposed method generalizes across multiple tasks and shows dramatically improved performance over baseline methods for categories with few training examples.
引用
收藏
页码:3994 / 4003
页数:10
相关论文
共 62 条
  • [1] [Anonymous], JMLR
  • [2] [Anonymous], IJCV
  • [3] [Anonymous], 2015, ICCV
  • [4] [Anonymous], 2010, NIPS
  • [5] [Anonymous], STAT COMPUTING
  • [6] [Anonymous], JMLR
  • [7] [Anonymous], 2007, Advances in Neural Information Processing Systems, DOI DOI 10.7551/MITPRESS/7503.003.0010
  • [8] [Anonymous], 2007, P 24 ANN INT C MACH
  • [9] [Anonymous], 2009, CVPR
  • [10] [Anonymous], 2013, 31 INT C MACH LEARN