Nonasymptotic analysis of Stochastic Gradient Hamiltonian Monte Carlo under local conditions for nonconvex optimization

被引：0

作者：

Akyildiz, O. Deniz ^{[1
]}

Sabanis, Sotirios ^{[2
,3
]}

机构：

[1] Imperial Coll London, Dept Math, London, England

[2] Univ Edinburgh, Sch Math, Edinburgh, Scotland

[3] Natl Tech Univ Athens, Alan Turing Inst, Athens, Greece

来源：

JOURNAL OF MACHINE LEARNING RESEARCH | 2024年 / 25卷

基金：

英国工程与自然科学研究理事会;

关键词：

Non-convex optimization; underdamped Langevin Monte Carlo; non-log-concave sampling; nonasmyptotic bounds; global optimization; DEPENDENT DATA STREAMS; LANGEVIN DYNAMICS; CONVERGENCE;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We provide a nonasymptotic analysis of the convergence of the stochastic gradient Hamiltonian Monte Carlo (SGHMC) to a target measure in Wasserstein-2 distance without assuming log-concavity. Our analysis quantifies key theoretical properties of the SGHMC as a sampler under local conditions which significantly improves the findings of previous results. In particular, we prove that the Wasserstein-2 distance between the target and the law of the SGHMC is uniformly controlled by the step-size of the algorithm, therefore demonstrate that the SGHMC can provide high-precision results uniformly in the number of iterations. The analysis also allows us to obtain nonasymptotic bounds for nonconvex optimization problems under local conditions and implies that the SGHMC, when viewed as a nonconvex optimizer, converges to a global minimum with the best known rates. We apply our results to obtain nonasymptotic bounds for scalable Bayesian inference and nonasymptotic generalization bounds.

引用

页数：34

共 42 条

[31]

Mousavi-Hosseini A, 2023, Arxiv, DOI arXiv:2303.03589

[32]

Raginsky Maxim, 2017, 2017 C LEARNING THEO, P1674

[33] Langevin Diffusions and Metropolis-Hastings Algorithms [J].

G. O. Roberts ;

O. Stramer .

Methodology And Computing In Applied Probability, 2002, 4 (4) :337-357

[34]

ROBERTS G. O., 1996, Bernoulli, V2, P341

[35]

SDalalyan A, 2018, Arxiv, DOI arXiv:1807.09382

[36] Higher order Langevin Monte Carlo algorithm [J].

Sabanis, Sotirios ;

Zhang, Ying .

ELECTRONIC JOURNAL OF STATISTICS, 2019, 13 (02) :3805-3850

[37]

Vempala SS, 2019, ADV NEUR IN, V32

[38]

Welling M., 2011, P 28 INT C INT C MAC, P681, DOI [DOI 10.5555/310448, DOI 10.5555/3104482.3104568]

[39]

Xu P., 2018, Advances in Neural Information Processing Systems, V31, P3122

[40]

Zhang MS, 2023, PR MACH LEARN RES, V195, P36

← 1 2 3 4 5 →