Self-adaptive robust nonlinear regression for unknown noise via mixture of Gaussians

被引:20
作者
Wang, Haibo [1 ]
Wang, Yun [2 ]
Hu, Qinghua [2 ]
机构
[1] Hubei Univ Technol, Sch Econ & Management, Wuhan, Hubei, Peoples R China
[2] Tianjin Univ, Sch Comp Sci & Technol, Tianjin, Peoples R China
关键词
Self-adaptive nonlinear regression; Unknown noise; Mixture of Gaussians; Expectation maximization; SUPPORT VECTOR REGRESSION; ANOMALY DETECTION; MODEL; STRATEGY; OUTLIERS; MACHINE; POINTS; SETS; SVM;
D O I
10.1016/j.neucom.2017.01.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For most regression problems, the optimal regression model can be obtained by minimizing a loss function, and the selection of loss functions has great effect on the performance of the derived regression model. Squared loss is widely used in regression. It is theoretically optimal for Gaussian noise. However, real data are usually polluted by complex and unknown noise, especially in the era of big data, the noise may not be fitted well by any single distribution. To address the above problem, two novel nonlinear regression models for single-task and multi-task problems are developed in this work, where the noise is fitted by Mixture of Gaussians. It was proved that any continuous distributions can be approximated by Mixture of Gaussians. To obtain the optimal parameters in the proposed models, an iterative algorithm based on Expectation Maximization is designed. The proposed models turn to be a self-adaptive robust nonlinear regression models. The experimental results on synthetic and real-world benchmark datasets show that the proposed models produce good performance compared with current regression algorithms and provide superior robustness.
引用
收藏
页码:274 / 286
页数:13
相关论文
共 47 条
[1]  
[Anonymous], 1988, Mixture Models: Inference and Applications to Clustering
[2]  
[Anonymous], 2005, APPL LINEAR REGRESSI
[3]  
ANZAI Y, 1989, PATTERN RECOGNITION
[4]   A multi-step outlier-based anomaly detection approach to network-wide traffic [J].
Bhuyan, Monowar H. ;
Bhattacharyya, D. K. ;
Kalita, J. K. .
INFORMATION SCIENCES, 2016, 348 :243-271
[5]   Bayesian nonlinear regression models with scale mixtures of skew-normal distributions: Estimation and case influence diagnostics [J].
Cancho, Vicente G. ;
Dey, Dipak K. ;
Lachos, Victor ;
Andrade, Marinho G. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (01) :588-602
[6]   A robust weighted least squares support vector regression based on least trimmed squares [J].
Chen, Chuanfa ;
Yan, Changqing ;
Li, Yanyan .
NEUROCOMPUTING, 2015, 168 :941-946
[7]   Recursive robust least squares support vector regression based on maximum correntropy criterion [J].
Chen, Xiaobo ;
Yang, Jian ;
Liang, Jun ;
Ye, Qiaolin .
NEUROCOMPUTING, 2012, 97 :63-73
[8]  
De Brabanter K, 2009, LECT NOTES COMPUT SC, V5768, P100, DOI 10.1007/978-3-642-04274-4_11
[9]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[10]   A multivariate linear regression analysis using finite mixtures of t distributions [J].
Galimberti, Giuliano ;
Soffritti, Gabriele .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 :138-150