Fuzzy regression functions with a noise cluster and the impact of outliers on mainstream machine learning methods in the regression setting

被引:31
作者
Chakravarty, Srinivas [1 ]
Demirhan, Haydar [1 ]
Baser, Furkan [2 ]
机构
[1] RMIT Univ, Discipline Math Sci, Sch Sci, Melbourne, Vic, Australia
[2] Ankara Univ, Fac Appl Sci, Dept Actuarial Sci, Ankara, Turkey
关键词
Artificial neural network; Fuzzy regression function; Outlier; Robustness; Support vector machine; SUPPORT VECTOR MACHINE; ALGORITHM; MODEL;
D O I
10.1016/j.asoc.2020.106535
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The presence of outliers in the dependent and/or independent features distorts predictions with machine learning techniques and may lead to erroneous conclusions. It is important to implement methods that are robust against the outliers to make reliable predictions and to know the accuracy of the existing methods when data is contaminated with outliers. The focus of this study is to propose a robust fuzzy regression functions (FRFN) approach against the outliers and evaluate the performance of the proposed and several mainstream machine learning approaches in the presence of outliers for the regression problem. The proposed FRFN approach is based on fuzzy k-means clustering with a noise cluster. We compare the accuracy of Artificial Neural Networks (ANN), Support Vector Machines (SVM) and the proposed FRFN approaches with different training algorithms kernel functions via simulated and real benchmark datasets. In total, accuracies of 36 ANN, SVM, and FRNF implementations with training algorithms and kernel and loss functions have been evaluated and compared to each other with samples containing outliers via a Monte Carlo simulation setting. It is observed in both Monte Carlo simulations and applications with benchmark dataset that FRFN with ANN trained with Bayes regularization algorithm and FRFN with SVM with Gaussian kernel outperforms the classical implementations of ANN and SVMs under the existence of outliers. The proposed noise cluster implementation considerably increases the robustness of fuzzy regression functions against outliers. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 66 条
[1]   A partial-robust-ridge-based regression model with fuzzy predictors-responses [J].
Akbari, Mohammad Ghasem ;
Hesamian, Gholamreza .
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2019, 351 :290-301
[2]   Robust multilayer neural network based on median neuron model [J].
Aladag, Cagdas Hakan ;
Egrioglu, Erol ;
Yolcu, Ufuk .
NEURAL COMPUTING & APPLICATIONS, 2014, 24 (3-4) :945-956
[3]  
[Anonymous], 2011, Handbook of Parametric and Nonparametric Statistical Procedures
[4]  
[Anonymous], [No title captured]
[5]  
[Anonymous], [No title captured]
[6]  
[Anonymous], 1981, PATTERN RECOGN, DOI 10.1007/978-1-4757-0450-1_3
[7]  
Araújo CA Jr, 2019, PESQUI AGROPECU BRAS, V54, DOI [10.1590/S1678-3921.pab2019.v54.00078, 10.1590/s1678-3921.pab2019.v54.00078]
[8]  
Arel-Bundock V., 2020, RDATASETS ARCH DATAS
[9]   A generalized feedforward neural network architecture for classification and regression [J].
Arulampalam, G ;
Bouzerdoum, A .
NEURAL NETWORKS, 2003, 16 (5-6) :561-568
[10]   Robust learning algorithm for multiplicative neuron model artificial neural networks [J].
Bas, Eren ;
Uslu, Vedide Rezan ;
Egrioglu, Erol .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 56 :80-88