Convergence of a first-order consensus-based global optimization algorithm

被引:43
作者
Ha, Seung-Yeal [1 ,2 ,3 ]
Jin, Shi [4 ,5 ]
Kim, Doheon [3 ]
机构
[1] Seoul Natl Univ, Dept Math Sci, Seoul 08826, South Korea
[2] Seoul Natl Univ, Res Inst Math, Seoul 08826, South Korea
[3] Korea Inst Adv Study, Sch Math, Hoegiro 85, Seoul 02455, South Korea
[4] Shanghai Jiao Tong Univ, Sch Math Sci, Shanghai 200240, Peoples R China
[5] Shanghai Jiao Tong Univ, Inst Nat Sci, Shanghai 200240, Peoples R China
基金
新加坡国家研究基金会;
关键词
Consensus-based optimization; Gibb’ s distribution; global optimization; machine learning; objective function; FLOCKING; DYNAMICS; MODEL;
D O I
10.1142/S0218202520500463
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Global optimization of a non-convex objective function often appears in large-scale machine learning and artificial intelligence applications. Recently, consensus-based optimization (CBO) methods have been introduced as one of the gradient-free optimization methods. In this paper, we provide a convergence analysis for the first-order CBO method in [J. A. Carrillo, S. Jin, L. Li and Y. Zhu, A consensus-based global optimization method for high dimensional machine learning problems, https://arxiv.org/abs/1909.09249v1]. Prior to this work, the convergence study was carried out for CBO methods on corresponding mean-field limit, a Fokker-Planck equation, which does not imply the convergence of the CBO method per se. Based on the consensus estimate directly on the first-order CBO model, we provide a convergence analysis of the first-order CBO method [J. A. Carrillo, S. Jin, L. Li and Y. Zhu, A consensus-based global optimization method for high dimensional machine learning problems, https://arxiv.org/abs/1909.09249v1] without resorting to the corresponding mean-field model. Our convergence analysis consists of two steps. In the first step, we show that the CBO model exhibits a global consensus time asymptotically for any initial data, and in the second step, we provide a sufficient condition on system parameters - which is dimension independent - and initial data which guarantee that the converged consensus state lies in a small neighborhood of the global minimum almost surely.
引用
收藏
页码:2417 / 2444
页数:28
相关论文
共 31 条
[1]   The Kuramoto model:: A simple paradigm for synchronization phenomena [J].
Acebrón, JA ;
Bonilla, LL ;
Vicente, CJP ;
Ritort, F ;
Spigler, R .
REVIEWS OF MODERN PHYSICS, 2005, 77 (01) :137-185
[2]   Stochastic flocking dynamics of the Cucker-Smale model with multiplicative white noises [J].
Ahn, Shin Mi ;
Ha, Seung-Yeal .
JOURNAL OF MATHEMATICAL PHYSICS, 2010, 51 (10)
[3]   Vehicular traffic, crowds, and swarms: From kinetic theory and multiscale methods to applications and research perspectives [J].
Albi, G. ;
Bellomo, N. ;
Fermo, L. ;
Ha, S. -Y. ;
Kim, J. ;
Pareschi, L. ;
Poyato, D. ;
Soler, J. .
MATHEMATICAL MODELS & METHODS IN APPLIED SCIENCES, 2019, 29 (10) :1901-2005
[4]  
[Anonymous], 2003, Convex analysis and optimization
[5]  
Carrillo J. A., CONSENSUS BASED GLOB
[6]  
Carrillo J. A., CONSENSUS BASED GLOB
[7]   An analytical framework for consensus-based global optimization method [J].
Carrillo, Jose A. ;
Choi, Young-Pil ;
Totzeck, Claudia ;
Tse, Oliver .
MATHEMATICAL MODELS & METHODS IN APPLIED SCIENCES, 2018, 28 (06) :1037-1066
[8]  
Choi Y.-P., 2017, SERIES MODELING SIMU, VI
[9]   On the mathematics of emergence [J].
Cucker, Felipe ;
Smale, Steve .
JAPANESE JOURNAL OF MATHEMATICS, 2007, 2 (01) :197-227
[10]  
Ditto W, 2002, NATURE, V415, P736, DOI 10.1038/415736b