Robust optimal subsampling based on weighted asymmetric least squares

被引:2
|
作者
Ren, Min [1 ]
Zhao, Shengli [1 ]
Wang, Mingqiu [1 ]
Zhu, Xinbei [2 ]
机构
[1] Qufu Normal Univ, Sch Stat & Data Sci, Qufu 273165, Shandong, Peoples R China
[2] Virginia Tech Univ, Dept Comp Sci, Blacksburg, VA 24061 USA
基金
中国国家自然科学基金;
关键词
Asymmetric least squares; Massive data; Poisson subsampling; Robustness; REGRESSION;
D O I
10.1007/s00362-023-01480-7
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
With the development of contemporary science, a large amount of generated data includes heterogeneity and outliers in the response and/or covariates. Furthermore, subsampling is an effective method to overcome the limitation of computational resources. However, when data include heterogeneity and outliers, incorrect subsampling probabilities may select inferior subdata, and statistic inference on this subdata may have a far inferior performance. Combining the asymmetric least squares and L2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_2$$\end{document} estimation, this paper proposes a double-robustness framework (DRF), which can simultaneously tackle the heterogeneity and outliers in the response and/or covariates. The Poisson subsampling is implemented based on the DRF for massive data, and a more robust probability will be derived to select the subdata. Under some regularity conditions, we establish the asymptotic properties of the subsampling estimator based on the DRF. Numerical studies and actual data demonstrate the effectiveness of the proposed method.
引用
收藏
页码:2221 / 2251
页数:31
相关论文
共 50 条
  • [1] A Robust Pruning Method for Reduced Weighted Least Squares Support Vector Machine
    Si GangQuan
    Guo Zhang
    Shi JianQuan
    PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 3576 - 3581
  • [2] Asymmetric and robust loss function driven least squares support vector machine
    Zhao, Xiaoxi
    Fu, Saiji
    Tian, Yingjie
    Zhao, Kun
    KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [3] Robust and Efficient Weighted Least Squares Adjustment of Relative Gravity Data
    Touati, F.
    Kahlouche, S.
    Idres, M.
    GRAVITY, GEOID AND EARTH OBSERVATION, 2010, 135 : 59 - 65
  • [4] REPRESENTATION OF THE LEAST WEIGHTED SQUARES
    Visek, Jan Amos
    ADVANCES AND APPLICATIONS IN STATISTICS, 2015, 47 (02) : 91 - 144
  • [5] Penalized Weighted Least Squares to Small Area Estimation
    Zhu, Rong
    Zou, Guohua
    Liang, Hua
    Zhu, Lixing
    SCANDINAVIAN JOURNAL OF STATISTICS, 2016, 43 (03) : 736 - 756
  • [6] An Improved Injection Model for Pansharpening Based on Weighted Least Squares
    Shi, Yan
    Wang, Wei
    Tan, Aiyong
    2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021), 2021,
  • [7] A kernel-based weight-setting method in robust weighted least squares support vector regression
    Wen, Wen
    Hao, Zhi-Feng
    Shao, Zhuang-Feng
    Yang, Xiao-Wei
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 4206 - +
  • [8] Asymmetric least squares support vector machine classifiers
    Huang, Xiaolin
    Shi, Lei
    Suykens, Johan A. K.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 70 : 395 - 405
  • [9] Doubly-Robust Dynamic Treatment Regimen Estimation Via Weighted Least Squares
    Wallace, Michael P.
    Moodie, Erica E. M.
    BIOMETRICS, 2015, 71 (03) : 636 - 644
  • [10] On the equivalence of the weighted least squares and the generalised least squares estimators, with applications to kernel smoothing
    Luati, Alessandra
    Proietti, Tommaso
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2011, 63 (04) : 851 - 871