ON THE EFFICIENCY OF ONLINE APPROACH TO NONPARAMETRIC SMOOTHING OF BIG DATA

被引:11
|
作者
Kong, Efang [1 ]
Xia, Yingcun [2 ,3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Math Sci, 4,Sect 2,North Jianshe Rd, Chengdu 610054, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Chengdu, Sichuan, Peoples R China
[3] Natl Univ Singapore, Dept Stat & Appl Probabil, Singapore 117546, Singapore
基金
中国国家自然科学基金;
关键词
Big data; kernel density estimation; N-W estimation; online updating estimation; varying coefficient model; KERNEL DENSITY-ESTIMATION;
D O I
10.5705/ss.202015.0365
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The online updating approach (ONLINE) has been commonly used for the analysis of big data and online transient data. We consider in this paper how to improve its efficiency for various ONLINE kernel-based nonparametric estimators. Our findings include: (i) the optimal choice concerning the bandwidth and how it differs from that for the classical estimators; (ii) the optimal choice among a general class of sequential updating schemes; (iii) that the relative efficiencies of ONLINE Parzen-Rosenblatt density estimation or Nadaraya-Waston (N-W) regression estimation change with the dimension p of covariate in a nonlinear manner, and (iv) that while the classical local-linear fitting renders the estimators design-adaptive, their ONLINE counterparts still depend on the design of covariates in its leading terms of bias, they are still preferred over the ONLINE N-W estimators.
引用
收藏
页码:185 / 201
页数:17
相关论文
共 50 条
  • [32] Equivalent kernels of smoothing splines in nonparametric regression for clustered/longitudinal data
    Lin, XH
    Wang, NY
    Welsh, AH
    Carroll, RJ
    BIOMETRIKA, 2004, 91 (01) : 177 - 193
  • [33] NONPARAMETRIC INFERENCE FOR RIGHT-CENSORED DATA USING SMOOTHING SPLINES
    Hao, Meiling
    Lin, Yuanyuan
    Zhao, Xingqiu
    STATISTICA SINICA, 2020, 30 (01) : 153 - 173
  • [34] Nonparametric estimation of the log odds ratio for sparse data by kernel smoothing
    Chen, Ziqi
    Shi, Ning-Zhong
    Gao, Wei
    STATISTICS & PROBABILITY LETTERS, 2011, 81 (12) : 1802 - 1807
  • [35] A nonparametric smoothing method for assessing GEE models with longitudinal binary data
    Lin, Kuo-Chin
    Chen, Yi-Ju
    Shyr, Yu
    STATISTICS IN MEDICINE, 2008, 27 (22) : 4428 - 4439
  • [36] Job and Candidate Recommendation with Big Data Support: A Contextual Online Learning Approach
    Dong, Shaokang
    Lei, Zijian
    Zhou, Pan
    Bian, Kaigui
    Liu, Guanghui
    GLOBECOM 2017 - 2017 IEEE GLOBAL COMMUNICATIONS CONFERENCE, 2017,
  • [37] Predicting online e-marketplace sales performances: A big data approach
    Li, Boying
    Ch'ng, Eugene
    Chong, Alain Yee-Loong
    Bao, Haijun
    COMPUTERS & INDUSTRIAL ENGINEERING, 2016, 101 : 565 - 571
  • [38] An Approach of Russian Online Learning Behavior Analysis and Mining Based on Big Data
    Wang, Yiqiong
    2019 INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION, BIG DATA & SMART CITY (ICITBS), 2019, : 232 - 235
  • [39] Course Recommendation of MOOC with Big Data Support: A Contextual Online Learning Approach
    Hou, Yifan
    Zhou, Pan
    Xu, Jie
    Wu, Dapeng Oliver
    IEEE INFOCOM 2018 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2018, : 106 - 111
  • [40] Assessing learners' satisfaction in collaborative online courses through a big data approach
    Elia, Gianluca
    Solazzo, Gianluca
    Lorenzo, Gianluca
    Passiante, Giuseppina
    COMPUTERS IN HUMAN BEHAVIOR, 2019, 92 : 589 - 599