Strong optimality of kernel functional regression in Lp\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L^p$$\end{document} norms with partial response variables and applications

被引:0
作者
Majid Mojirsheibani [1 ]
机构
[1] California State University Northridge,Department of Mathematics
关键词
Nonparametric; Functional regression; Rates of convergence; Classification; Partially observed response; Primary 62G05; Secondary 62G08;
D O I
10.1007/s00362-024-01611-8
中图分类号
学科分类号
摘要
This paper proposes kernel-type estimators of a regression function, with possibly unobservable response variables in a functional covariate setting, along with their rates of convergence in general Lp\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L^p$$\end{document} norms. Here, the mechanism that causes the absence of information (in the sense of having unobservable responses) is allowed to depend on both predictors and the response variables; this makes the problem particularly more challenging in those cases where model identifiability is an issue. As an immediate byproduct of these results, we propose asymptotically optimal classification rules for the challenging problem of semi-supervised learning based on the proposed estimators. Our proposed approach involves two steps: in the first step, we construct a family of models (possibly infinite dimensional) indexed by the unknown parameter of the missing probability mechanism. In the second step, a search is carried out to find the empirically optimal member of an appropriate cover (or subclass) of the underlying family in the sense of minimizing a weighted mean squared prediction error. The main focus of the paper is to look into the rates of almost complete convergence of the Lp\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L^p$$\end{document} norms of these estimators. The issue of identifiability is also addressed. As an application of our findings, we consider the classical problem of statistical classification based on the proposed regression estimators when there are a large number of missing labels in the data.
引用
收藏
页码:5615 / 5648
页数:33
相关论文
共 89 条
[1]  
Abraham C(2006)On the kernel rule for functional classification AISM 58 619-633
[2]  
Biau G(2013)Density-sensitive semisupervised inference Ann Stat 41 751-771
[3]  
Cadre B(2006)Nearest neighbor classification in infinite dimensions ESAIM-Probab Stat 10 340-355
[4]  
Azizyan M(2020)Pseudo likelihood-based estimation and testing of missingness mechanism function in nonignorable missing data problems Scand J Stat 47 1377-1400
[5]  
Singh A(1996)Kernel estimation of distribution functions and quantiles with missing data Stat Sin 6 63-78
[6]  
Wasserman L(2018)Imputation-based adjusted score equations in generalized linear models with nonignorable missing covariate values Stat Sin 28 1677-1701
[7]  
Cérou F(2012)Statistical computing in functional data analysis: the R package fda.usc J Stat Softw 51 1-28
[8]  
Guyader A(2010)Rate of uniform consistency for nonparametric estimates with functional variables J Stat Plan Inference 140 335-352
[9]  
Chen X(2013)Mean estimation with data missing at random for functional covariables Statistics 47 688-706
[10]  
Diao G(2019)Model checking for general linear regression with nonignorable missing response Comput Stat Data Anal 138 1-12