RECENT DEVELOPMENTS IN MACHINE LEARNING METHODS FOR STOCHASTIC CONTROL AND GAMES

被引:0
|
作者
Hu, Ruimeng [1 ,2 ]
Lauriere, Mathieu [3 ]
机构
[1] Univ Calif Santa Barbara, Dept Math, Santa Barbara, CA 93106 USA
[2] Univ Calif Santa Barbara, Dept Stat & Appl Probabil, Santa Barbara, CA 93106 USA
[3] NYU Shanghai, Shanghai Frontiers Sci Ctr Artificial Intelligence, NYU ECNU Inst Math Sci, 567 West Yangsi Rd, Shanghai 200126, Peoples R China
来源
关键词
Stochastic optimal control; stochastic games; mean field games; ma- chine learning; deep learning; MEAN-FIELD GAMES; PARTIAL-DIFFERENTIAL-EQUATIONS; MULTILAYER FEEDFORWARD NETWORKS; DISCRETE-TIME APPROXIMATION; MARKOV DECISION-PROCESSES; SEMI-LAGRANGIAN SCHEME; DEEP NEURAL-NETWORKS; NUMERICAL-METHODS; FICTITIOUS PLAY; POLICY GRADIENT;
D O I
10.3934/naco.2024031
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
. Stochastic optimal control and games have a wide range of applications, from finance and economics to social sciences, robotics, and energy management. Many real-world applications involve complex models that have driven the development of sophisticated numerical methods. Recently, computational methods based on machine learning have been developed for solving stochastic control problems and games. In this review, we focus on deep learning methods that have unlocked the possibility of solving such problems, even in high dimensions or when the structure is very complex, beyond what traditional numerical methods can achieve. We consider mostly the continuous time and continuous space setting. Many of the new approaches build on recent neural-network-based methods for solving high-dimensional partial differential equations or backward stochastic differential equations, or on model-free reinforcement learning for Markov decision processes that have led to breakthrough results. This paper provides an introduction to these methods and summarizes the state-of-the-art works at the crossroad of machine learning and stochastic control and games.
引用
收藏
页码:435 / 525
页数:91
相关论文
共 50 条
  • [1] Recent developments in machine learning modeling methods for hypertension treatment
    Kohjitani, Hirohiko
    Koshimizu, Hiroshi
    Nakamura, Kazuki
    Okuno, Yasushi
    HYPERTENSION RESEARCH, 2024, 47 (03) : 826 - 828
  • [2] Recent developments in machine learning modeling methods for hypertension treatment
    Hirohiko Kohjitani
    Hiroshi Koshimizu
    Kazuki Nakamura
    Yasushi Okuno
    Hypertension Research, 2024, 47 : 700 - 707
  • [3] Machine Learning in Disaster Management: Recent Developments in Methods and Applications
    Linardos, Vasileios
    Drakaki, Maria
    Tzionas, Panagiotis
    Karnavas, Yannis L.
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2022, 4 (02): : 446 - 473
  • [4] Machine learning and deep learning methods for intrusion detection systems: recent developments and challenges
    Geeta Kocher
    Gulshan Kumar
    Soft Computing, 2021, 25 : 9731 - 9763
  • [5] Machine learning and deep learning methods for intrusion detection systems: recent developments and challenges
    Kocher, Geeta
    Kumar, Gulshan
    SOFT COMPUTING, 2021, 25 (15) : 9731 - 9763
  • [6] Machine learning of stochastic automata and evolutionary games
    Lee, Bor-Hon
    Yang, Albert Jing-Fuh
    Chen, Yenming J.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (04) : 7875 - 7881
  • [7] Recent developments in stochastic distribution control - A review
    Yue, H
    Wang, H
    MEASUREMENT & CONTROL, 2003, 36 (07): : 209 - 215
  • [8] Recent Developments on the Stability and Control of Stochastic Systems
    Zhu, Quanxin
    Son Nguyen
    Liu, Ruihua
    Shaikhet, Leonid
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [9] Recent Developments in Causal Inference and Machine Learning
    Brand, Jennie E.
    Zhou, Xiang
    Xie, Yu
    ANNUAL REVIEW OF SOCIOLOGY, 2023, 49 : 81 - 110
  • [10] Recent Developments in Machine Learning for Mass Spectrometry
    Beck, Armen G.
    Muhoberac, Matthew
    Randolph, Caitlin E.
    Beveridge, Connor H.
    Wijewardhane, Prageeth R.
    Kentta''maa, Hilkka I.
    Chopra, Gaurav
    ACS MEASUREMENT SCIENCE AU, 2024, 4 (03): : 233 - 246