Combining Bayesian optimization and Lipschitz optimization

被引:0
作者
Mohamed Osama Ahmed
Sharan Vaswani
Mark Schmidt
机构
[1] University of British Columbia,
来源
Machine Learning | 2020年 / 109卷
关键词
Bayesian optimization; Global optimization; Lipschitz optimzation; Optimization;
D O I
暂无
中图分类号
学科分类号
摘要
Bayesian optimization and Lipschitz optimization have developed alternative techniques for optimizing black-box functions. They each exploit a different form of prior about the function. In this work, we explore strategies to combine these techniques for better global optimization. In particular, we propose ways to use the Lipschitz continuity assumption within traditional BO algorithms, which we call Lipschitz Bayesian optimization (LBO). This approach does not increase the asymptotic runtime and in some cases drastically improves the performance (while in the worst case the performance is similar). Indeed, in a particular setting, we prove that using the Lipschitz information yields the same or a better bound on the regret compared to using Bayesian optimization on its own. Moreover, we propose a simple heuristics to estimate the Lipschitz constant, and prove that a growing estimate of the Lipschitz constant is in some sense “harmless”. Our experiments on 15 datasets with 4 acquisition functions show that in the worst case LBO performs similar to the underlying BO method while in some cases it performs substantially better. Thompson sampling in particular typically saw drastic improvements (as the Lipschitz information corrected for its well-known “over-exploration” pheonemon) and its LBO variant often outperformed other acquisition functions.
引用
收藏
页码:79 / 102
页数:23
相关论文
共 33 条
  • [1] Bull AD(2011)Convergence rates of efficient global optimization algorithms Journal of Machine Learning Research 12 2879-2904
  • [2] Hennig P(2012)Entropy search for information-efficient global optimization Journal of Machine Learning Research 13 1809-1837
  • [3] Schuler CJ(2016)A general framework for constrained Bayesian optimization using information-based search Journal of Machine Learning Research 17 5549-5601
  • [4] Hernández-Lobato JM(2013)A literature survey of benchmark functions for global optimisation problems International Journal of Mathematical Modelling and Numerical Optimisation 4 150-194
  • [5] Gelbart MA(1993)Lipschitzian optimization without the lipschitz constant Journal of Optimization Theory and Applications 79 157-181
  • [6] Adams RP(1998)Efficient global optimization of expensive black-box functions Journal of Global optimization 13 455-492
  • [7] Hoffman MW(1964)A new method of locating the maximum point of an arbitrary multipeak curve in the presence of noise Journal of Basic Engineering 86 97-106
  • [8] Ghahramani Z(2012)An experimental methodology for response surface optimization methods Journal of Global Optimization 53 699-736
  • [9] Jamil M(1972)An algorithm for finding the absolute extremum of a function USSR Computational Mathematics and Mathematical Physics 12 57-67
  • [10] Yang XS(2013)Derivative-free optimization: A review of algorithms and comparison of software implementations Journal of Global Optimization 56 1247-1293