Robust optimal subsampling based on weighted asymmetric least squares

被引:2
|
作者
Ren, Min [1 ]
Zhao, Shengli [1 ]
Wang, Mingqiu [1 ]
Zhu, Xinbei [2 ]
机构
[1] Qufu Normal Univ, Sch Stat & Data Sci, Qufu 273165, Shandong, Peoples R China
[2] Virginia Tech Univ, Dept Comp Sci, Blacksburg, VA 24061 USA
基金
中国国家自然科学基金;
关键词
Asymmetric least squares; Massive data; Poisson subsampling; Robustness; REGRESSION;
D O I
10.1007/s00362-023-01480-7
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
With the development of contemporary science, a large amount of generated data includes heterogeneity and outliers in the response and/or covariates. Furthermore, subsampling is an effective method to overcome the limitation of computational resources. However, when data include heterogeneity and outliers, incorrect subsampling probabilities may select inferior subdata, and statistic inference on this subdata may have a far inferior performance. Combining the asymmetric least squares and L2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_2$$\end{document} estimation, this paper proposes a double-robustness framework (DRF), which can simultaneously tackle the heterogeneity and outliers in the response and/or covariates. The Poisson subsampling is implemented based on the DRF for massive data, and a more robust probability will be derived to select the subdata. Under some regularity conditions, we establish the asymptotic properties of the subsampling estimator based on the DRF. Numerical studies and actual data demonstrate the effectiveness of the proposed method.
引用
收藏
页码:2221 / 2251
页数:31
相关论文
共 50 条
  • [31] Robust least squares support vector machine based on recursive outlier elimination
    Wen, Wen
    Hao, Zhifeng
    Yang, Xiaowei
    SOFT COMPUTING, 2010, 14 (11) : 1241 - 1251
  • [32] Dempster Shafer theory-based robust least squares support vector machine for stochastic modelling
    Zhou, Chuang
    Lu, Xinjiang
    Huang, MingHui
    NEUROCOMPUTING, 2016, 182 : 145 - 153
  • [33] Robust least squares support vector machine based on recursive outlier elimination
    Wen Wen
    Zhifeng Hao
    Xiaowei Yang
    Soft Computing, 2010, 14 : 1241 - 1251
  • [34] Robust Least Squares Projection Twin SVM and its Sparse Solution
    Zhou, Shuisheng
    Zhang, Wenmeng
    Chen, Li
    Xu, Mingliang
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2023, 34 (04) : 827 - 838
  • [35] Asymmetric least squares for multiple spectra baseline correction
    Peng, Jiangtao
    Peng, Silong
    Jiang, An
    Wei, Jiping
    Li, Changwen
    Tan, Jie
    ANALYTICA CHIMICA ACTA, 2010, 683 (01) : 63 - 68
  • [36] Robust spectrotemporal decomposition by iteratively reweighted least squares
    Ba, Demba
    Babadi, Behtash
    Purdon, Patrick L.
    Brown, Emery N.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (50) : E5336 - E5345
  • [37] A Nonlinear Robust Partial Least Squares Method with Application
    Jia, Runda
    Mao, Zhizhong
    Chang, Yuqing
    2010 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-5, 2010, : 2334 - 2339
  • [38] Weighted Least Squares Regression with the Best Robustness and High Computability
    Zuo, Yijun
    Zuo, Hanwen
    AXIOMS, 2024, 13 (05)
  • [39] On dealing with censored largest observations under weighted least squares
    Khan, Md Hasinur Rahaman
    Shaw, J. Ewart H.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2016, 86 (18) : 3758 - 3776
  • [40] Robust image watermarking scheme based on subsampling
    Fu, YG
    Shen, RM
    Shen, LP
    THIRD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 2, PROCEEDINGS, 2005, : 361 - 365