Recursive Quantile Estimation: Non-Asymptotic Confidence Bounds

被引：0

作者：

Chen, Likai ^{[1
]}

Keilbar, Georg ^{[2
]}

Wu, Wei Biao ^{[3
]}

机构：

[1] Washington Univ, Dept Math & Stat, St Louis, MO 63130 USA

[2] Univ Vienna, Dept Stat & Operat Res, Vienna, Austria

[3] Univ Chicago, Dept Stat, Chicago, IL USA

来源：

JOURNAL OF MACHINE LEARNING RESEARCH | 2023年 / 24卷

关键词：

Finite sample bounds; quantiles; stochastic gradient descent; Polyak-Ruppert averaging; recursive estimation; STOCHASTIC-APPROXIMATION; HILBERT-SPACES; REGRESSION; INFERENCE; RISK;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper considers the recursive estimation of quantiles using the stochastic gradient descent (SGD) algorithm with Polyak-Ruppert averaging. The algorithm offers a computationally and memory efficient alternative to the usual empirical estimator. Our focus is on studying the non-asymptotic behavior by providing exponentially decreasing tail probability bounds under mild assumptions on the smoothness of the density functions. This novel non-asymptotic result is based on a bound of the moment generating function of the SGD estimate. We apply our result to the problem of best arm identification in a multi-armed stochastic bandit setting under quantile preferences.

引用

页数：25

共 39 条

[31] Growing-dimensional partially functional linear models: non-asymptotic optimal prediction error
Zhang, Huiming
Lei, Xiaoyu
PHYSICA SCRIPTA, 2023, 98 (09)
[32] Shrinkage for covariance estimation: asymptotics, confidence intervals, bounds and applications in sensor monitoring and finance
Ansgar Steland
Statistical Papers, 2018, 59 : 1441 - 1462
[33] Shrinkage for covariance estimation: asymptotics, confidence intervals, bounds and applications in sensor monitoring and finance
Steland, Ansgar
STATISTICAL PAPERS, 2018, 59 (04) : 1441 - 1462
[34] Asymptotic normality for a non parametric estimator of conditional quantile with left-truncated data
Yao, Mei
Wang, Jiang-Feng
Lin, Lu
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (13) : 6280 - 6292
[35] Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples
Xu, Tengyu
Zou, Shaofeng
Liang, Yingbin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[36] A Non-asymptotic Risk Bound for Model Selection in a High-Dimensional Mixture of Experts via Joint Rank and Variable Selection
TrungTin Nguyen
Dung Ngoc Nguyen
Hien Duy Nguyen
Chamroukhi, Faicel
ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT II, 2024, 14472 : 234 - 245
[37] Recursive non-parametric kernel classification rule estimation for independent functional data
Slaoui, Yousri
COMPUTATIONAL STATISTICS, 2021, 36 (01) : 79 - 112
[38] Protection of six-phase transmission line using recursive estimation of non-linear autoregression model coefficients and decision tree
Althi, Tirupathi Rao
Koley, Ebha
Ghosh, Subhojit
IET SCIENCE MEASUREMENT & TECHNOLOGY, 2020, 14 (10) : 931 - 942
[39] Residual Attention Network-Based Confidence Estimation Algorithm for Non-Holonomic Constraint in GNSS/INS Integrated Navigation System
Xiao, Yimin
Luo, Haiyong
Zhao, Fang
Wu, Fan
Gao, Xile
Wang, Qu
Cui, Lizhen
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (11) : 11404 - 11418

← 1 2 3 4 →