Fuzziness in data analysis: Towards accuracy and robustness

被引:18
作者
Colubi, Ana [1 ]
Gonzalez-Rodriguez, Gil [1 ]
机构
[1] Univ Oviedo, Dept Stat & OR, INDUROT, Mieres 3600, Spain
关键词
Fuzzy methods; Fuzzy data; Fuzziness; Randomness; Statistics; Robust data analysis; Trimming; LINEAR-REGRESSION MODEL; FUZZY RANDOM-VARIABLES; MEANS CLUSTERING MODEL; STATISTICAL-ANALYSIS; ALGORITHMS; VARIANCE; INFORMATION; COMPACT;
D O I
10.1016/j.fss.2015.05.007
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The first aim is to emphasize the use of fuzziness in data analysis to capture information that has been traditionally disregarded with a cost in the precision of the conclusions. Fuzziness can be considered in the data analysis process at various stages, but the main target in this paper will be fuzziness in the data. Depending on the nature of the fuzzy data or the aim to which they are handled, different approaches should be applied. We attempt to contribute to the clarification of such a difference while focusing on the so-called ontic approach in contrast to the epistemic approach. The second aim is to underline the need of considering robust methods to reduce the misleading impact of outliers in fuzzy data analysis. We propose trimming as a general and intuitive method to discard outliers. We exemplify this approach with the case of the ontic fuzzy trimmed mean/variance and highlight the differences with the epistemic case. All the discussions and developments are illustrated by means of a case-study concerning the perception of lengths of men and women. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:260 / 271
页数:12
相关论文
共 57 条
[21]   Maximum likelihood estimation from fuzzy data using the EM algorithm [J].
Denoeux, Thierry .
FUZZY SETS AND SYSTEMS, 2011, 183 (01) :72-91
[22]   FUZZY LEAST-SQUARES [J].
DIAMOND, P .
INFORMATION SCIENCES, 1988, 46 (03) :141-157
[23]   Data analysis with fuzzy clustering methods [J].
Doering, Christian ;
Lesot, Marie-Jeanne ;
Kruse, Rudolf .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (01) :192-214
[24]   On various ways of tackling incomplete information in statistics [J].
Dubois, Didier .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2014, 55 (07) :1570-1574
[25]   Gradualness, uncertainty and bipolarity: Making sense of fuzzy sets [J].
Dubois, Didier ;
Prade, Henri .
FUZZY SETS AND SYSTEMS, 2012, 192 :3-24
[26]   A linear regression model for imprecise response [J].
Ferraro, M. B. ;
Coppi, R. ;
Gonzalez Rodriguez, G. ;
Colubi, A. .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2010, 51 (07) :759-770
[27]   On possibilistic clustering with repulsion constraints for imprecise data [J].
Ferraro, Maria Brigida ;
Giordani, Paolo .
INFORMATION SCIENCES, 2013, 245 :63-75
[28]   A multiple linear regression model for imprecise information [J].
Ferraro, Maria Brigida ;
Giordani, Paolo .
METRIKA, 2012, 75 (08) :1049-1068
[29]   Recurrent neural networks for fuzzy data [J].
Freitag, Steffen ;
Graf, Wolfgang ;
Kaliske, Michael .
INTEGRATED COMPUTER-AIDED ENGINEERING, 2011, 18 (03) :265-280
[30]   Estimating the functional relationships for quality function deployment under uncertainties [J].
Fung, RYK ;
Chen, YZ ;
Tang, JF .
FUZZY SETS AND SYSTEMS, 2006, 157 (01) :98-120