A Mean-Field Optimal Control Formulation for Global Optimization

被引:11
作者
Zhang, Chi [1 ,2 ]
Taghvaei, Amirhossein [1 ,2 ]
Mehta, Prashant G. [1 ,2 ]
机构
[1] UIUC, Coordinated Sci Lab, Urbana, IL 61801 USA
[2] UIUC, Dept Mech Sci & Engn, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
Mean-field optimal control; global optimization; particle filter; MONTE-CARLO; FRAMEWORK;
D O I
10.1109/TAC.2018.2833060
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper is concerned with variational optimal control constructions whose solution yields a sampling algorithm. The particular form of the sampling algorithm considered here is a particle filter, designed to numerically approximate the solution to the global optimization problem. The theoretical significance of this study comes from its variational aspects. Specifically, the control input represents the solution of a mean-field-type optimal control problem. Its parametric counterpart, obtained when a parametric form of density is known a priori, is shown to be equivalent to the natural gradient algorithm. Explicit formulae for the filter are derived when the objective function is quadratic and the density is Gaussian. The optimal control construction of the particle filter is a significant departure from the classical importance sampling-resampling-based approaches.
引用
收藏
页码:282 / 289
页数:8
相关论文
共 26 条
[1]   Natural gradient works efficiently in learning [J].
Amari, S .
NEURAL COMPUTATION, 1998, 10 (02) :251-276
[2]  
[Anonymous], 2003, TOPICS OPTIMAL TRANS
[3]  
[Anonymous], 2017, THESIS
[4]  
Bakry Dominique, 2013, ANAL GEOMETRY MARKOV, V348
[5]   The Master equation in mean field theory [J].
Bensoussan, Alain ;
Frehse, Jens ;
Yam, Sheung Chi Phillip .
JOURNAL DE MATHEMATIQUES PURES ET APPLIQUEES, 2015, 103 (06) :1441-1474
[6]  
Daum F, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS (MFI), P540
[7]   Exact particle flow for nonlinear filters [J].
Daum, Fred ;
Huang, Jim ;
Noushin, Arjang .
SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION XIX, 2010, 7697
[8]  
Dullerud Geir E, 2013, A course in robust control theory: a convex approach, V36
[9]  
Durrett R., 2010, PROBABILITY THEORY E
[10]   Riemann manifold Langevin and Hamiltonian Monte Carlo methods [J].
Girolami, Mark ;
Calderhead, Ben .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2011, 73 :123-214