ON THE EFFICIENCY OF ONLINE APPROACH TO NONPARAMETRIC SMOOTHING OF BIG DATA

被引:11
|
作者
Kong, Efang [1 ]
Xia, Yingcun [2 ,3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Math Sci, 4,Sect 2,North Jianshe Rd, Chengdu 610054, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Chengdu, Sichuan, Peoples R China
[3] Natl Univ Singapore, Dept Stat & Appl Probabil, Singapore 117546, Singapore
基金
中国国家自然科学基金;
关键词
Big data; kernel density estimation; N-W estimation; online updating estimation; varying coefficient model; KERNEL DENSITY-ESTIMATION;
D O I
10.5705/ss.202015.0365
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The online updating approach (ONLINE) has been commonly used for the analysis of big data and online transient data. We consider in this paper how to improve its efficiency for various ONLINE kernel-based nonparametric estimators. Our findings include: (i) the optimal choice concerning the bandwidth and how it differs from that for the classical estimators; (ii) the optimal choice among a general class of sequential updating schemes; (iii) that the relative efficiencies of ONLINE Parzen-Rosenblatt density estimation or Nadaraya-Waston (N-W) regression estimation change with the dimension p of covariate in a nonlinear manner, and (iv) that while the classical local-linear fitting renders the estimators design-adaptive, their ONLINE counterparts still depend on the design of covariates in its leading terms of bias, they are still preferred over the ONLINE N-W estimators.
引用
收藏
页码:185 / 201
页数:17
相关论文
共 50 条
  • [1] An approach to nonparametric smoothing techniques for regressions with discrete data
    Mukhopadhyay, K
    Marsh, LC
    APPLIED ECONOMICS, 2006, 38 (03) : 301 - 305
  • [2] A Nonparametric Adaptive Sampling Strategy for Online Monitoring of Big Data Streams
    Xian, Xiaochen
    Wang, Andi
    Liu, Kaibo
    TECHNOMETRICS, 2018, 60 (01) : 14 - 25
  • [3] A Nonparametric Adaptive Sampling Strategy for Online Monitoring of Big Data Streams
    Xian, Xiaochen
    Wang, Andi
    Liu, Kaibo
    2017 13TH IEEE CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2017, : 844 - 846
  • [4] Joint modeling of multivariate nonparametric longitudinal data and survival data: A local smoothing approach
    You, Lu
    Qiu, Peihua
    STATISTICS IN MEDICINE, 2021, 40 (29) : 6689 - 6706
  • [5] A NONPARAMETRIC APPROACH TO SMOOTHING BY AGGREGATION OVER PREFERENCES
    ALLEN, B
    JOURNAL OF MATHEMATICAL ECONOMICS, 1982, 10 (2-3) : 279 - 297
  • [6] An online approach for feature selection for classification in big data
    Nazar, Nasrin Banu
    Senthilkumar, Radha
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (01) : 163 - 171
  • [7] A Scalable Approach for Online Hierarchical Big Data Mining
    Vanli, N. Denizcan
    Sayin, Muhammed O.
    Delibalta, Ibrahim
    Kozat, Suleyman S.
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 1 - 8
  • [8] Nonparametric regression for interval-valued data based on local linear smoothing approach
    Kong, Lingtao
    Song, Xiangjun
    Wang, Xiaomin
    NEUROCOMPUTING, 2022, 501 : 834 - 843
  • [9] Measuring Surveillance In Online Advertising: A Big Data Approach
    Herps, Aaron
    Watters, Paul A.
    Pineda-Villavicencio, Guillermo
    2013 FOURTH CYBERCRIME AND TRUSTWORTHY COMPUTING WORKSHOP (CTC 2013), 2014, : 30 - +
  • [10] Three sides of smoothing: Categorical data smoothing, nonparametric regression, and density estimation
    Simonoff, JS
    MINING AND MODELING MASSIVE DATA SETS IN SCIENCE, ENGINEERING, AND BUSINESS WITH A SUBTHEME IN ENVIRONMENTAL STATISTICS, 1997, 29 (01): : 509 - 518