Fuzziness in data analysis: Towards accuracy and robustness

被引:18
作者
Colubi, Ana [1 ]
Gonzalez-Rodriguez, Gil [1 ]
机构
[1] Univ Oviedo, Dept Stat & OR, INDUROT, Mieres 3600, Spain
关键词
Fuzzy methods; Fuzzy data; Fuzziness; Randomness; Statistics; Robust data analysis; Trimming; LINEAR-REGRESSION MODEL; FUZZY RANDOM-VARIABLES; MEANS CLUSTERING MODEL; STATISTICAL-ANALYSIS; ALGORITHMS; VARIANCE; INFORMATION; COMPACT;
D O I
10.1016/j.fss.2015.05.007
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The first aim is to emphasize the use of fuzziness in data analysis to capture information that has been traditionally disregarded with a cost in the precision of the conclusions. Fuzziness can be considered in the data analysis process at various stages, but the main target in this paper will be fuzziness in the data. Depending on the nature of the fuzzy data or the aim to which they are handled, different approaches should be applied. We attempt to contribute to the clarification of such a difference while focusing on the so-called ontic approach in contrast to the epistemic approach. The second aim is to underline the need of considering robust methods to reduce the misleading impact of outliers in fuzzy data analysis. We propose trimming as a general and intuitive method to discard outliers. We exemplify this approach with the case of the ontic fuzzy trimmed mean/variance and highlight the differences with the epistemic case. All the discussions and developments are illustrated by means of a case-study concerning the perception of lengths of men and women. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:260 / 271
页数:12
相关论文
共 57 条
[1]   Testing linear independence in linear models with interval-valued data [J].
Angeles Gil, Maria ;
Gonzalez-Rodriguez, Gil ;
Colubi, Ana ;
Montenegro, Manuel .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 51 (06) :3002-3015
[2]  
[Anonymous], 2018, Robust Statistics: Theory and Methods
[3]  
[Anonymous], Pattern Recognition with Fuzzy Objective Function Algorithms,, DOI 10.1007/978-1-4757-0450-1_3
[4]   Multiple regression with fuzzy data [J].
Bargiela, Andrzej ;
Pedrycz, Witold ;
Nakashima, Tomoharu .
FUZZY SETS AND SYSTEMS, 2007, 158 (19) :2169-2188
[5]   One-sample tests for a generalized Fr,chet variance of a fuzzy random variable [J].
Belen Ramos-Guajardo, Ana ;
Colubi, Ana ;
Gonzalez-Rodriguez, Gil ;
Angeles Gil, Maria .
METRIKA, 2010, 71 (02) :185-202
[6]   A revisited approach to linear fuzzy regression using trapezoidal fuzzy intervals [J].
Bisserier, Amory ;
Boukezzoula, Reda ;
Galichet, Sylvie .
INFORMATION SCIENCES, 2010, 180 (19) :3653-3673
[7]  
Blanco-Fernández A, 2014, INT J APPROX REASON, V55, P1487, DOI 10.1016/j.ijar.2013.09.020
[8]   A set arithmetic-based linear regression model for modelling interval-valued responses through real-valued variables [J].
Blanco-Fernandez, Angela ;
Colubi, Ana ;
Garcia-Barzana, Marta .
INFORMATION SCIENCES, 2013, 247 :109-122
[9]   On the formalization of fuzzy random variables [J].
Colubi, A ;
Dominguez-Menchero, JS ;
López-Díaz, M ;
Ralescu, DA .
INFORMATION SCIENCES, 2001, 133 (1-2) :3-6
[10]  
Colubi A., 2007, METRON INT J STAT, VLXV, P277