Grammars for Games: A Gradient - Based, Game-Theoretic Framework for Optimization in Deep Learning

被引:2
|
作者
Balduzzi, David [1 ]
机构
[1] Victoria Univ Wellington, Sch Math & Stat, Wellington, New Zealand
来源
FRONTIERS IN ROBOTICS AND AI | 2016年 / 2卷
关键词
deep learning; representation learning; optimization; game theory; neural networks;
D O I
10.3389/frobt.2015.00039
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Deep learning is currently the subject of intensive study. However, fundamental concepts such as representations are not formally defined researchers "know them when they see them" and there is no common language for describing and analyzing algorithms. This essay proposes an abstract framework that identifies the essential features of current practice and may provide a foundation for future developments. The backbone of almost all deep learning algorithms is backpropagation, which is simply a gradient computation distributed over a neural network. The main ingredients of the framework are, thus, unsurprisingly: (i) game theory, to formalize distributed optimization; and (ii) communication protocols, to track the flow of zeroth and first-order information. The framework allows natural definitions of semantics (as the meaning encoded in functions), representations (as functions whose semantics is chosen to optimized a criterion), and grammars (as communication protocols equipped with first-order convergence guarantees). Much of the essay is spent discussing examples taken from the literature. The ultimate aim is to develop a graphical language for describing the structure of deep learning algorithms that backgrounds the details of the optimization procedure and foregrounds how the components interact. Inspiration is taken from probabilistic graphical models and factor graphs, which capture the essential structural features of multivariate distributions.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Games, Dollars, Splits: A Game-Theoretic Analysis of Split Manufacturing
    Gohil V.
    Tressler M.
    Sipple K.
    Patnaik S.
    Rajendran J.
    IEEE Transactions on Information Forensics and Security, 2021, 16 : 5077 - 5092
  • [22] A game-theoretic framework for the security system of visible watermarking
    Tsai, Min-Jen
    Liu, Jung
    Wang, Chen-Sheng
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) : 5748 - 5754
  • [23] Game-Theoretic Framework for Malicious Controller Detection in Software Defined Networks
    Sridharan, Vignesh
    Gurusamy, Mohan
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2021, 18 (03): : 3107 - 3120
  • [24] Integrated Model of Production Plan and Preventive Maintenance Based on a Game-Theoretic Framework
    Hu, J. W.
    Jiang, Z. H.
    2015 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2015, : 1007 - 1011
  • [25] Generosity Pays Off: A Game-Theoretic Study of Cooperation in Decentralized Learning
    Di Giacomo, G.
    Malandrino, F.
    Chiasserini, C. F.
    2024 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS, ICC WORKSHOPS 2024, 2024, : 105 - 110
  • [26] Quantitative Analysis of Systems Using Game-Theoretic Learning
    Seshia, Sanjit A.
    Rakhlin, Alexander
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2012, 11
  • [27] A Game-Theoretic Framework for Optimum Decision Fusion in the Presence of Byzantines
    Abrardo, Andrea
    Barni, Mauro
    Kallas, Kassem
    Tondi, Benedetta
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2016, 11 (06) : 1333 - 1345
  • [28] A Game-Theoretic Framework for the Virtual Machines Migration Timing Problem
    Anwar, Ahmed H.
    Atia, George
    Guirguis, Mina
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2021, 9 (03) : 854 - 867
  • [29] An enriched game-theoretic framework for multi-objective clustering
    Badami, Mahsa
    Hamzeh, Ali
    Hashemi, Sattar
    APPLIED SOFT COMPUTING, 2013, 13 (04) : 1853 - 1868
  • [30] GAME-THEORETIC RATE-DISTORTION-COMPLEXITY OPTIMIZATION FOR HEVC
    Ukhanova, Anna
    Milani, Simone
    Forchhammer, Soren
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 1995 - 1999