Convergence of a first-order consensus-based global optimization algorithm

被引:36
作者
Ha, Seung-Yeal [1 ,2 ,3 ]
Jin, Shi [4 ,5 ]
Kim, Doheon [3 ]
机构
[1] Seoul Natl Univ, Dept Math Sci, Seoul 08826, South Korea
[2] Seoul Natl Univ, Res Inst Math, Seoul 08826, South Korea
[3] Korea Inst Adv Study, Sch Math, Hoegiro 85, Seoul 02455, South Korea
[4] Shanghai Jiao Tong Univ, Sch Math Sci, Shanghai 200240, Peoples R China
[5] Shanghai Jiao Tong Univ, Inst Nat Sci, Shanghai 200240, Peoples R China
基金
新加坡国家研究基金会;
关键词
Consensus-based optimization; Gibb’ s distribution; global optimization; machine learning; objective function; FLOCKING; DYNAMICS; MODEL;
D O I
10.1142/S0218202520500463
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Global optimization of a non-convex objective function often appears in large-scale machine learning and artificial intelligence applications. Recently, consensus-based optimization (CBO) methods have been introduced as one of the gradient-free optimization methods. In this paper, we provide a convergence analysis for the first-order CBO method in [J. A. Carrillo, S. Jin, L. Li and Y. Zhu, A consensus-based global optimization method for high dimensional machine learning problems, https://arxiv.org/abs/1909.09249v1]. Prior to this work, the convergence study was carried out for CBO methods on corresponding mean-field limit, a Fokker-Planck equation, which does not imply the convergence of the CBO method per se. Based on the consensus estimate directly on the first-order CBO model, we provide a convergence analysis of the first-order CBO method [J. A. Carrillo, S. Jin, L. Li and Y. Zhu, A consensus-based global optimization method for high dimensional machine learning problems, https://arxiv.org/abs/1909.09249v1] without resorting to the corresponding mean-field model. Our convergence analysis consists of two steps. In the first step, we show that the CBO model exhibits a global consensus time asymptotically for any initial data, and in the second step, we provide a sufficient condition on system parameters - which is dimension independent - and initial data which guarantee that the converged consensus state lies in a small neighborhood of the global minimum almost surely.
引用
收藏
页码:2417 / 2444
页数:28
相关论文
共 31 条
  • [1] Aarts Emile H. L, 1987, Simulated Annealing: A Pedestrian Review of the Theory and Some Applications
  • [2] The Kuramoto model:: A simple paradigm for synchronization phenomena
    Acebrón, JA
    Bonilla, LL
    Vicente, CJP
    Ritort, F
    Spigler, R
    [J]. REVIEWS OF MODERN PHYSICS, 2005, 77 (01) : 137 - 185
  • [3] Stochastic flocking dynamics of the Cucker-Smale model with multiplicative white noises
    Ahn, Shin Mi
    Ha, Seung-Yeal
    [J]. JOURNAL OF MATHEMATICAL PHYSICS, 2010, 51 (10)
  • [4] Vehicular traffic, crowds, and swarms: From kinetic theory and multiscale methods to applications and research perspectives
    Albi, G.
    Bellomo, N.
    Fermo, L.
    Ha, S. -Y.
    Kim, J.
    Pareschi, L.
    Poyato, D.
    Soler, J.
    [J]. MATHEMATICAL MODELS & METHODS IN APPLIED SCIENCES, 2019, 29 (10) : 1901 - 2005
  • [5] Bertsekas Dimitri P., 2003, Convex Analysis and Optimization
  • [6] Carrillo J. A., CONSENSUS BASED GLOB
  • [7] Carrillo J. A., CONSENSUS BASED GLOB
  • [8] An analytical framework for consensus-based global optimization method
    Carrillo, Jose A.
    Choi, Young-Pil
    Totzeck, Claudia
    Tse, Oliver
    [J]. MATHEMATICAL MODELS & METHODS IN APPLIED SCIENCES, 2018, 28 (06) : 1037 - 1066
  • [9] Choi Y.-P., 2017, SERIES MODELING SIMU, VI
  • [10] On the mathematics of emergence
    Cucker, Felipe
    Smale, Steve
    [J]. JAPANESE JOURNAL OF MATHEMATICS, 2007, 2 (01): : 197 - 227