Recursive Quantile Estimation: Non-Asymptotic Confidence Bounds

被引:0
|
作者
Chen, Likai [1 ]
Keilbar, Georg [2 ]
Wu, Wei Biao [3 ]
机构
[1] Washington Univ, Dept Math & Stat, St Louis, MO 63130 USA
[2] Univ Vienna, Dept Stat & Operat Res, Vienna, Austria
[3] Univ Chicago, Dept Stat, Chicago, IL USA
关键词
Finite sample bounds; quantiles; stochastic gradient descent; Polyak-Ruppert averaging; recursive estimation; STOCHASTIC-APPROXIMATION; HILBERT-SPACES; REGRESSION; INFERENCE; RISK;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper considers the recursive estimation of quantiles using the stochastic gradient descent (SGD) algorithm with Polyak-Ruppert averaging. The algorithm offers a computationally and memory efficient alternative to the usual empirical estimator. Our focus is on studying the non-asymptotic behavior by providing exponentially decreasing tail probability bounds under mild assumptions on the smoothness of the density functions. This novel non-asymptotic result is based on a bound of the moment generating function of the SGD estimate. We apply our result to the problem of best arm identification in a multi-armed stochastic bandit setting under quantile preferences.
引用
收藏
页数:25
相关论文
共 39 条
  • [31] Growing-dimensional partially functional linear models: non-asymptotic optimal prediction error
    Zhang, Huiming
    Lei, Xiaoyu
    PHYSICA SCRIPTA, 2023, 98 (09)
  • [32] Shrinkage for covariance estimation: asymptotics, confidence intervals, bounds and applications in sensor monitoring and finance
    Ansgar Steland
    Statistical Papers, 2018, 59 : 1441 - 1462
  • [33] Shrinkage for covariance estimation: asymptotics, confidence intervals, bounds and applications in sensor monitoring and finance
    Steland, Ansgar
    STATISTICAL PAPERS, 2018, 59 (04) : 1441 - 1462
  • [34] Asymptotic normality for a non parametric estimator of conditional quantile with left-truncated data
    Yao, Mei
    Wang, Jiang-Feng
    Lin, Lu
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (13) : 6280 - 6292
  • [35] Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples
    Xu, Tengyu
    Zou, Shaofeng
    Liang, Yingbin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [36] A Non-asymptotic Risk Bound for Model Selection in a High-Dimensional Mixture of Experts via Joint Rank and Variable Selection
    TrungTin Nguyen
    Dung Ngoc Nguyen
    Hien Duy Nguyen
    Chamroukhi, Faicel
    ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT II, 2024, 14472 : 234 - 245
  • [37] Recursive non-parametric kernel classification rule estimation for independent functional data
    Slaoui, Yousri
    COMPUTATIONAL STATISTICS, 2021, 36 (01) : 79 - 112
  • [38] Protection of six-phase transmission line using recursive estimation of non-linear autoregression model coefficients and decision tree
    Althi, Tirupathi Rao
    Koley, Ebha
    Ghosh, Subhojit
    IET SCIENCE MEASUREMENT & TECHNOLOGY, 2020, 14 (10) : 931 - 942
  • [39] Residual Attention Network-Based Confidence Estimation Algorithm for Non-Holonomic Constraint in GNSS/INS Integrated Navigation System
    Xiao, Yimin
    Luo, Haiyong
    Zhao, Fang
    Wu, Fan
    Gao, Xile
    Wang, Qu
    Cui, Lizhen
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (11) : 11404 - 11418