A Technical Note on Non-Stationary Parametric Bandits: Existing Mistakes and Preliminary Solutions

被引:0
|
作者
Faury, Louis [1 ,2 ]
Russac, Yoan [3 ,4 ,5 ,6 ]
Abeille, Marc [2 ]
Calauzenes, Clement [2 ]
机构
[1] LTCI TelecomParis, Paris, France
[2] Criteo AI Lab, Paris, France
[3] ENS Paris, Paris, France
[4] Univ PSL, Paris, France
[5] CNRS, Paris, France
[6] INRIA, Lille, France
来源
关键词
Stochastic Bandits; Generalized Linear Model; Non-Stationarity;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this note(1) we identify several mistakes appearing in the existing literature on non-stationary parametric bandits. More precisely, we study Generalized Linear Bandits (GLBs) in drifting environments, where the level of non-stationarity is characterized by a general metric known as the variation-budget. Existing methods to solve such problems typically involve forgetting mechanisms, which allow for a fine balance between the learning and tracking requirements of the problem. We uncover two significant mistakes in their theoretical analysis. The first arises when bounding the tracking error suffered by forgetting mechanisms. The second emerges when considering non-linear reward models, which requires extra care to balance the learning and tracking guarantees. We introduce a geometrical assumption on the arm set, sufficient to overcome the aforementioned technical gaps and recover minimax-optimality. We also share preliminary attempts at fixing those gaps under general configurations. Unfortunately, our solution yields degraded rates (w.r.t to the horizon), which raises new open questions regarding the optimality of forgetting mechanisms in non-stationary parametric bandits.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Series Solutions of the Non-Stationary Heun Equation
    Atai, Farrokh
    Langmann, Edwin
    SYMMETRY INTEGRABILITY AND GEOMETRY-METHODS AND APPLICATIONS, 2018, 14
  • [42] A note on a non-stationary point source spatial model
    Mark D. Ecker
    Victor De Oliveira
    Hans Isakson
    Environmental and Ecological Statistics, 2013, 20 : 59 - 67
  • [43] A note on a non-stationary point source spatial model
    Ecker, Mark D.
    De Oliveira, Victor
    Isakson, Hans
    ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2013, 20 (01) : 59 - 67
  • [44] Robust non-parametric smoothing of non-stationary time series
    Grillenzoni, Carlo
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2009, 79 (04) : 379 - 393
  • [45] Parametric Synthesis of Non-Stationary Control Systems with Uncertainties and Constraints
    Garkushenko V.I.
    Russian Aeronautics, 2021, 64 (03): : 424 - 431
  • [46] NON-STATIONARY PROBABILITY OF PARAMETRIC ROLL OF SHIPS IN RANDOM SEAS
    Dostal, Leo
    Kreuzer, Edwin
    COMPUTATIONAL METHODS IN MARINE ENGINEERING V (MARINE 2013), 2013, : 772 - 780
  • [47] Estimation in semi-parametric regression with non-stationary regressors
    Chen, Jia
    Gao, Jiti
    Li, Degui
    BERNOULLI, 2012, 18 (02) : 678 - 702
  • [48] Non-Stationary Linear Bandits With Dimensionality Reduction for Large-Scale Recommender Systems
    Ghoorchian, Saeed
    Kortukov, Evgenii
    Maghsudi, Setareh
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 548 - 558
  • [49] Implementation of Exploration in TONIC Using Non-stationary Volatile Multi-arm Bandits
    Shaha, Aditya
    Arya, Dhruv
    Tripathy, B. K.
    SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2018, VOL 1, 2020, 1048 : 239 - 250
  • [50] A Risk-Averse Framework for Non-Stationary Stochastic Multi-Armed Bandits
    Alami, Reda
    Mahfoud, Mohammed
    Achab, Mastane
    2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 272 - 280