Robust partially linear models for automatic structure discovery

被引:1
作者
Han, Yuxiang [1 ]
Chen, Hong [1 ,2 ,3 ]
Gong, Tieliang [4 ]
Cai, Jia [5 ]
Deng, Hao [1 ]
机构
[1] Huazhong Agr Univ, Coll Sci, Wuhan 430070, Peoples R China
[2] Minist Educ, Engn Res Ctr Intelligent Technol Agr, Wuhan 430070, Peoples R China
[3] Hubei Engn Technol Res Ctr Agr Big Data, Wuhan 430070, Peoples R China
[4] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Peoples R China
[5] Guangdong Univ Finance & Econ, Sch Digital Econ, Guangzhou 510320, Peoples R China
关键词
Learning theory; Partially linear models; Kernel methods; Modal regression; Structure discovery; MAXIMUM-LIKELIHOOD-ESTIMATION; REGRESSION SHRINKAGE; VARIABLE SELECTION;
D O I
10.1016/j.eswa.2023.119528
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Partially linear models (PLMs), rooted in the combination of linear and nonlinear approximation, are recognized to be capable of modeling complex data. Indeed, the performance of PLMs depends heavily on the choice of model structure, such as which covariates have linear or nonlinear effects on the response. Nevertheless, most existing PLMs are limited to the mean regression, resulting in sensitivity to non-Gaussian noises, such as skewed noise and heavy-tailed noise. In order to mitigate the influence of noise in structure discovery, this paper proposes a Robust Linear And Nonlinear Discovery algorithm (RLAND) by integrating the modal regression and PLMs. Statistical analysis on generalization bound and structure discovery consistency are established to characterize its learning theory foundations. Computation analysis illustrates that the RLAND can be efficiently realized by half quadratic optimization and the quadratic programming. Empirical evaluations on simulation and real-world data validate the competitive performance of the proposed method.
引用
收藏
页数:14
相关论文
共 39 条
[1]  
Bartlett P. L., 2003, Journal of Machine Learning Research, V3, P463, DOI 10.1162/153244303321897690
[2]   Sparse Modal Additive Model [J].
Chen, Hong ;
Wang, Yingjie ;
Zheng, Feng ;
Deng, Cheng ;
Huang, Heng .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) :2373-2387
[3]  
Chen H, 2017, ADV NEUR IN, V30
[4]   NONPARAMETRIC MODAL REGRESSION [J].
Chen, Yen-Chi ;
Genovese, Christopher R. ;
Tibshirani, Ryan J. ;
Wasserman, Larry .
ANNALS OF STATISTICS, 2016, 44 (02) :489-514
[5]   An APN model for Arrhythmic beat classification [J].
Chiang, Hsiu-Sen ;
Shih, Dong-Her ;
Lin, Binshan ;
Shih, Ming-Hung .
BIOINFORMATICS, 2014, 30 (12) :1739-1746
[6]   A NOTE ON PREDICTION VIA ESTIMATION OF THE CONDITIONAL MODE FUNCTION [J].
COLLOMB, G ;
HARDLE, W ;
HASSANI, S .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1987, 15 (02) :227-236
[7]  
Cucker F, 2007, C MO AP C M, P1, DOI 10.1017/CBO9780511618796
[8]  
Dascalu M, 2020, ROM J INF SCI TECH, V23, pS15
[9]   Modelling beyond regression functions: an application of multimodal regression to speed-flow data [J].
Einbeck, Jochen ;
Tutz, Gerhard .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2006, 55 :461-475
[10]   SEMIPARAMETRIC ESTIMATES OF THE RELATION BETWEEN WEATHER AND ELECTRICITY SALES [J].
ENGLE, RF ;
GRANGER, CWJ ;
RICE, J ;
WEISS, A .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1986, 81 (394) :310-320