Cross-stitch Networks for Multi-task Learning

被引：818

作者：

Misra, Ishan ^{[1
]}

Shrivastava, Abhinav ^{[1
]}

Gupta, Abhinav ^{[1
]}

Hebert, Martial ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA

来源：

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年

关键词：

D O I：

10.1109/CVPR.2016.433

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-task learning in Convolutional Networks has displayed remarkable success in the field of recognition. This success can be largely attributed to learning shared representations from multiple supervisory tasks. However, existing multi-task approaches rely on enumerating multiple network architectures specific to the tasks at hand, that do not generalize. In this paper, we propose a principled approach to learn shared representations in ConvNets using multi-task learning. Specifically, we propose a new sharing unit: "cross-stitch" unit. These units combine the activations from multiple networks and can be trained end-to-end. A network with cross-stitch units can learn an optimal combination of shared and task-specific representations. Our proposed method generalizes across multiple tasks and shows dramatically improved performance over baseline methods for categories with few training examples.

引用

页码：3994 / 4003

页数：10

共 50 条

[41] Multi-Task Learning for Cross-Lingual Abstractive Summarization
Takase, Sho
Okazaki, Naoaki
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3008 - 3016
[42] Multi-task Learning Neural Networks for Comparative Elements Extraction
Liu, Dianqing
Wang, Lihui
Shao, Yanqiu
CHINESE LEXICAL SEMANTICS (CLSW 2020), 2021, 12278 : 398 - 407
[43] Multi-Task Learning with Sequence-Conditioned Transporter Networks
Lim, Michael H.
Zeng, Andy
Ichter, Brian
Bandari, Maryam
Coumans, Erwin
Tomlin, Claire
Schaal, Stefan
Faust, Aleksandra
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2489 - 2496
[44] Multi-Task Learning Based on Stochastic Configuration Neural Networks
Dong, Xue-Mei
Kong, Xudong
Zhang, Xiaoping
FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
[45] Bayesian online multi-task learning using regularization networks
Pillonetto, Gianluigi
Dinuzzo, Francesco
De Nicolao, Giuseppe
2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 4517 - +
[46] Deep Elastic Networks with Model Selection for Multi-Task Learning
Ahn, Chanho
Kim, Eunwoo
Oh, Songhwai
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6528 - 6537
[47] Multi-task Contrastive Learning for Anomaly Detection on Attributed Networks
Zhang, Junjie
Ding, Yuxin
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I, PAKDD 2024, 2024, 14645 : 15 - 26
[48] Multi-Task Reinforcement Meta-Learning in Neural Networks
Shakah, Ghazi
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 263 - 269
[49] Evolving Deep Parallel Neural Networks for Multi-Task Learning
Wu, Jie
Sun, Yanan
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT II, 2022, 13156 : 517 - 531
[50] Learning to Branch for Multi-Task Learning
Guo, Pengsheng
Lee, Chen-Yu
Ulbricht, Daniel
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119

← 1 2 3 4 5 →