Optimization by linear kinetic equations and mean-field Langevin dynamics

被引:0
|
作者
Pareschi, Lorenzo [1 ,2 ,3 ]
机构
[1] Heriot Watt Univ, Maxwell Inst Math Sci, Edinburgh EH14 4AP, Scotland
[2] Heriot Watt Univ, Dept Math, Edinburgh, Scotland
[3] Univ Ferrara, Dept Math & Comp Sci, Via Machiavelli 30, I-44123 Ferrara, Italy
来源
关键词
Simulated annealing; global optimization; linear Boltzmann equation; entropy inequalities; mean-field Langevin dynamics; stochastic gradient descent; sampling; CONSENSUS-BASED OPTIMIZATION; SIMULATED ANNEALING ALGORITHMS; GLOBAL OPTIMIZATION; WEAK-CONVERGENCE; INEQUALITIES; DIFFUSION;
D O I
10.1142/S0218202524500428
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
One of the most striking examples of the close connections between global optimization processes and statistical physics is the simulated annealing method, inspired by the famous Monte Carlo algorithm devised by Metropolis et al. in the middle of last century. In this paper, we show how the tools of linear kinetic theory allow the description of this gradient-free algorithm from the perspective of statistical physics and how convergence to the global minimum can be related to classical entropy inequalities. This analysis highlights the strong link between linear Boltzmann equations and stochastic optimization methods governed by Markov processes. Thanks to this formalism, we can establish the connections between the simulated annealing process and the corresponding mean-field Langevin dynamics characterized by a stochastic gradient descent approach. Generalizations to other selection strategies in simulated annealing that avoid the acceptance-rejection dynamic are also provided.
引用
收藏
页码:2191 / 2216
页数:26
相关论文
共 50 条
  • [1] ERGODICITY OF THE UNDERDAMPED MEAN-FIELD LANGEVIN DYNAMICS
    Kazeykina, Anna
    Ren, Zhenjie
    Tan, Xiaolu
    Yang, Junjian
    ANNALS OF APPLIED PROBABILITY, 2024, 34 (03): : 3181 - 3226
  • [2] Mean-field Langevin dynamics and energy landscape of neural networks
    Hu, Kaitong
    Ren, Zhenjie
    Siska, David
    Szpruch, Lukasz
    ANNALES DE L INSTITUT HENRI POINCARE-PROBABILITES ET STATISTIQUES, 2021, 57 (04): : 2043 - 2065
  • [3] MEAN-FIELD AND KINETIC DESCRIPTIONS OF NEURAL DIFFERENTIAL EQUATIONS
    Herty, Michael
    Trimborn, Torsten
    Visconti, Giuseppe
    FOUNDATIONS OF DATA SCIENCE, 2022, 4 (02): : 271 - 298
  • [4] KINETIC MEAN-FIELD THEORIES
    KARKHECK, J
    STELL, G
    JOURNAL OF CHEMICAL PHYSICS, 1981, 75 (03): : 1475 - 1487
  • [5] Variance Optimization and Control Regularity for Mean-Field Dynamics
    Bonnet, Benoit
    Rossi, Francesco
    IFAC PAPERSONLINE, 2021, 54 (19): : 13 - 18
  • [6] Sparse kinetic Jurdjevic-Quinn control for mean-field equations
    Piccoli, Benedetto
    Rossi, Francesco
    Trelat, Emmanuel
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 3960 - 3965
  • [7] Feature learning via mean-field Langevin dynamics: classifying sparse parities and beyond
    Suzuki, Taiji
    Wu, Denny
    Oko, Kazusato
    Nitanda, Atsushi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Exact controllability of linear mean-field stochastic systems and observability inequality for mean-field backward stochastic differential equations
    Ye, Wenjie
    Yu, Zhiyong
    ASIAN JOURNAL OF CONTROL, 2022, 24 (01) : 237 - 248
  • [9] Linear-Quadratic Delayed Mean-Field Social Optimization
    Nie, Tianyang
    Wang, Shujun
    Wu, Zhen
    APPLIED MATHEMATICS AND OPTIMIZATION, 2024, 89 (01):
  • [10] Linear-Quadratic Delayed Mean-Field Social Optimization
    Tianyang Nie
    Shujun Wang
    Zhen Wu
    Applied Mathematics & Optimization, 2024, 89