Online Non-Convex Optimization with Imperfect Feedback

被引:0
|
作者
Heliou, Amelie [1 ]
Martin, Matthieu [1 ]
Mertikopoulos, Panayotis [1 ,2 ]
Rahier, Thibaud [1 ]
机构
[1] Criteo AI Lab, Paris, France
[2] Univ Grenoble Alpes, CNRS, INRIA, LIG, Grenoble, France
关键词
ALGORITHM; REGRET;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of online learning with non-convex losses. In terms of feedback, we assume that the learner observes - or otherwise constructs - an inexact model for the loss function encountered at each stage, and we propose a mixed-strategy learning policy based on dual averaging. In this general context, we derive a series of tight regret minimization guarantees, both for the learner's static (external) regret, as well as the regret incurred against the best dynamic policy in hindsight. Subsequently, we apply this general template to the case where the learner only has access to the actual loss incurred at each stage of the process. This is achieved by means of a kernel-based estimator which generates an inexact model for each round's loss function using only the learner's realized losses as input.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] An Online Method for A Class of Distributionally Robust Optimization with Non-Convex Objectives
    Qi, Qi
    Guo, Zhishuai
    Xu, Yi
    Jin, Rong
    Yang, Tianbao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [12] Optimal, Stochastic, Non-smooth, Non-convex Optimization through Online-to-Non-convex Conversion
    Cutkosky, Ashok
    Mehta, Harsh
    Orabona, Francesco
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [13] Natasha: Faster Non-Convex Stochastic Optimization via Strongly Non-Convex Parameter
    Allen-Zhu, Zeyuan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [14] Network localization by non-convex optimization
    Saha, Ananya
    Sau, Buddhadeb
    MOBIMWAREHN'17: PROCEEDINGS OF THE 7TH ACM WORKSHOP ON MOBILITY, INTERFERENCE, AND MIDDLEWARE MANAGEMENT IN HETNETS, 2017,
  • [15] Managing systems with non-convex positive feedback
    Brock, WA
    Starrett, D
    ENVIRONMENTAL & RESOURCE ECONOMICS, 2003, 26 (04): : 575 - 602
  • [16] Gradient Methods for Non-convex Optimization
    Jain, Prateek
    JOURNAL OF THE INDIAN INSTITUTE OF SCIENCE, 2019, 99 (02) : 247 - 256
  • [17] Parallel continuous non-convex optimization
    Holmqvist, K
    Migdalas, A
    Pardalos, PM
    PARALLEL COMPUTING IN OPTIMIZATION, 1997, 7 : 471 - 527
  • [18] Gradient Methods for Non-convex Optimization
    Prateek Jain
    Journal of the Indian Institute of Science, 2019, 99 : 247 - 256
  • [19] Replica Exchange for Non-Convex Optimization
    Dong, Jing
    Tong, Xin T.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [20] Replica exchange for non-convex optimization
    Dong, Jing
    Tong, Xin T.
    1600, Microtome Publishing (22):