Cross-stitch Networks for Multi-task Learning

被引:818
|
作者
Misra, Ishan [1 ]
Shrivastava, Abhinav [1 ]
Gupta, Abhinav [1 ]
Hebert, Martial [1 ]
机构
[1] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA
关键词
D O I
10.1109/CVPR.2016.433
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-task learning in Convolutional Networks has displayed remarkable success in the field of recognition. This success can be largely attributed to learning shared representations from multiple supervisory tasks. However, existing multi-task approaches rely on enumerating multiple network architectures specific to the tasks at hand, that do not generalize. In this paper, we propose a principled approach to learn shared representations in ConvNets using multi-task learning. Specifically, we propose a new sharing unit: "cross-stitch" unit. These units combine the activations from multiple networks and can be trained end-to-end. A network with cross-stitch units can learn an optimal combination of shared and task-specific representations. Our proposed method generalizes across multiple tasks and shows dramatically improved performance over baseline methods for categories with few training examples.
引用
收藏
页码:3994 / 4003
页数:10
相关论文
共 50 条
  • [41] Multi-Task Learning for Cross-Lingual Abstractive Summarization
    Takase, Sho
    Okazaki, Naoaki
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3008 - 3016
  • [42] Multi-task Learning Neural Networks for Comparative Elements Extraction
    Liu, Dianqing
    Wang, Lihui
    Shao, Yanqiu
    CHINESE LEXICAL SEMANTICS (CLSW 2020), 2021, 12278 : 398 - 407
  • [43] Multi-Task Learning with Sequence-Conditioned Transporter Networks
    Lim, Michael H.
    Zeng, Andy
    Ichter, Brian
    Bandari, Maryam
    Coumans, Erwin
    Tomlin, Claire
    Schaal, Stefan
    Faust, Aleksandra
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2489 - 2496
  • [44] Multi-Task Learning Based on Stochastic Configuration Neural Networks
    Dong, Xue-Mei
    Kong, Xudong
    Zhang, Xiaoping
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
  • [45] Bayesian online multi-task learning using regularization networks
    Pillonetto, Gianluigi
    Dinuzzo, Francesco
    De Nicolao, Giuseppe
    2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 4517 - +
  • [46] Deep Elastic Networks with Model Selection for Multi-Task Learning
    Ahn, Chanho
    Kim, Eunwoo
    Oh, Songhwai
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6528 - 6537
  • [47] Multi-task Contrastive Learning for Anomaly Detection on Attributed Networks
    Zhang, Junjie
    Ding, Yuxin
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I, PAKDD 2024, 2024, 14645 : 15 - 26
  • [48] Multi-Task Reinforcement Meta-Learning in Neural Networks
    Shakah, Ghazi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 263 - 269
  • [49] Evolving Deep Parallel Neural Networks for Multi-Task Learning
    Wu, Jie
    Sun, Yanan
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT II, 2022, 13156 : 517 - 531
  • [50] Learning to Branch for Multi-Task Learning
    Guo, Pengsheng
    Lee, Chen-Yu
    Ulbricht, Daniel
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119