Generalized and Robust Least Squares Regression

被引:7
作者
Wang, Jingyu [1 ,2 ]
Xie, Fangyuan [1 ,2 ]
Nie, Feiping [1 ,3 ]
Li, Xuelong [1 ,3 ]
机构
[1] Northwestern Polytech Univ, Sch Artificial Intelligence OPt & Elect iOPEN, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Sch Astronaut, Xian 710072, Peoples R China
[3] Northwestern Polytech Univ, Key Lab Intelligent Interact & Applicat, Minist Ind & Informat Technol, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimization; Robustness; Feature extraction; Sparse matrices; Noise measurement; Task analysis; Iterative methods; Classification; feature selection (FS); generalized framework; linear regression; robust; NEURAL-NETWORK APPROACH; NEURODYNAMIC APPROACH; BOLTZMANN MACHINES; PARALLEL ALGORITHM; GENETIC ALGORITHM; LOCAL SEARCH; LARGE-SCALE; OPTIMIZATION; MODEL; COMBINATORIAL;
D O I
10.1109/TNNLS.2022.3213594
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a simple yet effective method, least squares regression (LSR) is extensively applied for data regression and classification. Combined with sparse representation, LSR can be extended to feature selection (FS) as well, in which $\ell _{1}$ regularization is often applied in embedded FS algorithms. However, because the loss function is in the form of squared error, LSR and its variants are sensitive to noises, which significantly degrades the effectiveness and performance of classification and FS. To cope with the problem, we propose a generalized and robust LSR (GRLSR) for classification and FS, which is made up of arbitrary concave loss function and the l(2,p)-norm regularization term. Meanwhile, an iterative algorithm is applied to efficiently deal with the nonconvex minimization problem, in which an additional weight to suppress the effect of noises is added to each data point. The weights can be automatically assigned according to the error of the samples. When the error is large, the value of the corresponding weight is small. It is this mechanism that allows GRLSR to reduce the impact of noises and outliers. According to the different formulations of the concave loss function, four specific methods are proposed to clarify the essence of the framework. Comprehensive experiments on corrupted datasets have proven the advantage of the proposed method.
引用
收藏
页码:7006 / 7020
页数:15
相关论文
共 113 条
[1]   BOLTZMANN MACHINES FOR TRAVELING SALESMAN PROBLEMS [J].
AARTS, EHL ;
KORST, JHM .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1989, 39 (01) :79-95
[2]  
AARTS EHL, 1991, ALGORITHMICA, V6, P437, DOI 10.1007/BF01759053
[3]   Greedy random adaptive memory programming search for the capacitated clustering problem [J].
Ahmadi, S ;
Osman, IH .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2005, 162 (01) :30-44
[4]   Constrained Clustering for the Capacitated Vehicle Routing Problem (CC-CVRP) [J].
Alesiani, Francesco ;
Ermis, Gulcin ;
Gkiotsalitis, Konstantinos .
APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
[5]   OPTIMIZATION NEURAL NETWORKS FOR THE SEGMENTATION OF MAGNETIC-RESONANCE IMAGES [J].
AMARTUR, SC ;
PIRAINO, D ;
TAKEFUJI, Y .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 1992, 11 (02) :215-220
[6]  
[Anonymous], 2000, P AAAI IAAI JUN
[7]  
[Anonymous], 1979, Comput. Intractability: A Guide to Theory NP-Completeness
[8]   MEASURING POWER OF HIERARCHICAL CLUSTER-ANALYSIS [J].
BAKER, FB ;
HUBERT, LJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1975, 70 (349) :31-38
[9]  
Ball, 1965, ISODATA NOVEL METHOD
[10]   Frequency-sensitive competitive learning for scalable balanced clustering on high-dimensional hyperspheres [J].
Banerjee, A ;
Ghosh, J .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2004, 15 (03) :702-719