Optimal subsampling design for polynomial regression in one covariate

被引:3
作者
Reuter, Torsten [1 ]
Schwabe, Rainer [1 ]
机构
[1] Otto von Guericke Univ, Univ Pl 2, D-39106 Magdeburg, Germany
关键词
Subdata; D-optimality; Massive data; Polynomial regression; ALGORITHMS;
D O I
10.1007/s00362-023-01425-0
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Improvements in technology lead to increasing availability of large data sets which makes the need for data reduction and informative subsamples ever more important. In this paper we construct D-optimal subsampling designs for polynomial regression in one covariate for invariant distributions of the covariate. We study quadratic regression more closely for specific distributions. In particular we make statements on the shape of the resulting optimal subsampling designs and the effect of the subsample size on the design. To illustrate the advantage of the optimal subsampling designs we examine the efficiency of uniform random subsampling.
引用
收藏
页码:1095 / 1117
页数:23
相关论文
共 50 条
[41]   D-optimal designs for combined polynomial and trigonometric regression on a partial circle [J].
Chang, Fu-Chuen ;
Li, Chin-Han .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2013, 143 (07) :1186-1194
[42]   Constrained D- and D1-optimal designs for polynomial regression [J].
Dette, H ;
Franke, T .
ANNALS OF STATISTICS, 2000, 28 (06) :1702-1727
[43]   Random perturbation subsampling for rank regression with massive data [J].
He, Sijin ;
Xia, Xiaochao .
STATISTICS AND COMPUTING, 2025, 35 (01)
[44]   OPTIMAL SUBSAMPLING ALGORITHMS FOR BIG DATA REGRESSIONS [J].
Ai, Mingyao ;
Yu, Jun ;
Zhang, Huiming ;
Wang, HaiYing .
STATISTICA SINICA, 2021, 31 (02) :749-772
[45]   Computer Aided Formula Design of PVA Abrasive Tool by Polynomial Regression [J].
Zhou, Z. Z. ;
Li, Z. ;
Yuan, J. L. .
DIGITAL DESIGN AND MANUFACTURING TECHNOLOGY II, 2011, 215 :182-185
[46]   Locally D-optimal designs for multistage models and heteroscedastic polynomial regression models [J].
Fang, Zhide ;
Wiens, Douglas P. ;
Wu, Zheyang .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2006, 136 (11) :4059-4070
[47]   A general approach to D-optimal designs for weighted univariate polynomial regression models [J].
Holger Dette ;
Matthias Trampisch .
Journal of the Korean Statistical Society, 2010, 39 :1-26
[48]   Optimal characteristic designs for polynomial models [J].
Rodríguez-Díaz, JM ;
López-Fidalgo, J .
OPTIMUM DESIGN 2000, 2001, 51 :123-130
[49]   Robust and efficient subsampling algorithms for massive data logistic regression [J].
Jin, Jun ;
Liu, Shuangzhe ;
Ma, Tiefeng .
JOURNAL OF APPLIED STATISTICS, 2024, 51 (08) :1427-1445
[50]   Statistical calibration and exact one-sided simultaneous tolerance intervals for polynomial regression [J].
Han, Yang ;
Liu, Wei ;
Bretz, Frank ;
Wan, Fang ;
Yang, Ping .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2016, 168 :90-96