Multi-Task Deep Learning Games: Investigating Nash Equilibria and Convergence Properties

被引：3

作者：

Lee, Minhyeok ^{[1
]}

机构：

[1] Chung Ang Univ, Sch Elect & Elect Engn, Seoul 06974, South Korea

来源：

AXIOMS | 2023年 / 12卷 / 06期

关键词：

multi-task deep learning; multi-task; multiple losses; deep learning; game theory; Nash equilibrium; learning agents;

D O I：

10.3390/axioms12060569

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

This paper conducts a rigorous game-theoretic analysis on multi-task deep learning, providing mathematical insights into the dynamics and interactions of tasks within these models. Multi-task deep learning has attracted significant attention in recent years due to its ability to leverage shared representations across multiple correlated tasks, leading to improved generalization and reduced training time. However, understanding and examining the interactions between tasks within a multi-task deep learning system poses a considerable challenge. In this paper, we present a game-theoretic investigation of multi-task deep learning, focusing on the existence and convergence of Nash equilibria. Game theory provides a suitable framework for modeling the interactions among various tasks in a multi-task deep learning system, as it captures the strategic behavior of learning agents sharing a common set of parameters. Our primary contributions include: casting the multi-task deep learning problem as a game where each task acts as a player aiming to minimize its task-specific loss function; introducing the notion of a Nash equilibrium for the multi-task deep learning game; demonstrating the existence of at least one Nash equilibrium under specific convexity and Lipschitz continuity assumptions for the loss functions; examining the convergence characteristics of the Nash equilibrium; and providing a comprehensive analysis of the implications and limitations of our theoretical findings. We also discuss potential extensions and directions for future research in the multi-task deep learning landscape.

引用

页数：22

共 43 条

[1] An Optimized Learning Model Augment Analyst Decisions for Seismic Source Discrimination
Abdalzaher, Mohamed S.
Moustafa, Sayed S. R.
Hafiez, H. E. Abdel
Ahmed, Walid Farid
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[2] A Deep Learning Model for Earthquake Parameters Observation in IoT System-Based Earthquake Early Warning
Abdalzaher, Mohamed S.
Soliman, M. Sami
El-Hady, Sherif M.
Benslimane, Abderrahim
Elwekeil, Mohamed
[J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (11): : 8412 - 8424
[3] A Game-Theoretic Approach for Enhancing Security and Data Trustworthiness in IoT Applications
Abdalzaher, Mohamed S.
Muta, Osamu
[J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (11): : 11250 - 11261
[4] Aggarwal A., 2021, International Journal of Information Management Data Insights, V1, DOI DOI 10.1016/J.JJIMEI.2020.100004
[5] Hardware Approximate Techniques for Deep Neural Network Accelerators: A Survey
Armeniakos, Giorgos
Zervakis, Georgios
Soudris, Dimitrios
Henkel, Joerg
[J]. ACM COMPUTING SURVEYS, 2023, 55 (04)
[6] Arpit D., 2019, Advances in Neural Information Processing Systems, V32, P10902
[7] Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields
Barron, Jonathan T.
Mildenhall, Ben
Verbin, Dor
Srinivasan, Pratul P.
Hedman, Peter
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5460 - 5469
[8] Blattmann A., 2022, Advances in Neural Information Processing Systems, V35, P15309
[9] Machine learning and the physical sciences
Carleo, Giuseppe
Cirac, Ignacio
Cranmer, Kyle
Daudet, Laurent
Schuld, Maria
Tishby, Naftali
Vogt-Maranto, Leslie
Zdeborova, Lenka
[J]. REVIEWS OF MODERN PHYSICS, 2019, 91 (04)
[10] A survey on deep learning applied to medical images: from simple artificial neural networks to generative models
Celard, P.
Iglesias, E. L.
Sorribes-Fdez, J. M.
Romero, R.
Vieira, A. Seara
Borrajo, L.
[J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (03) : 2291 - 2323

← 1 2 3 4 5 →