Fuzzy clustering of fuzzy data based on robust loss functions and ordered weighted averaging

被引:31
作者
D'Urso, Pierpaolo [1 ]
Leski, Jacek M. [2 ,3 ]
机构
[1] Sapienza Univ Rome, Dept Social Sci & Econ, Ple Aldo Moro 5, Rome, Italy
[2] Silesian Tech Univ, Inst Elect, Akad 16, PL-44100 Gliwice, Poland
[3] Inst Med Technol & Equipment, Dept Comp Med Syst, Roosevelt St 118, PL-41800 Zabrze, Poland
关键词
Fuzzy data; Robust fuzzy clustering; Fuzzy c-ordered medoids clustering; M-estimators; Ordered weighted averaging; INFORMATIONAL PARADIGM; SIMILARITY MEASURES; SETS; MODELS; ALGORITHMS; DISTANCES; ENTROPY; NUMBERS;
D O I
10.1016/j.fss.2019.03.017
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In many real cases the data are not expressed in term of single values but are imprecise. In all these cases, standard clustering methods for single-valued data are unable to properly take into account the imprecise nature of the data. In this paper, by considering the Partitioning Around Medoids (PAM) approach in a fuzzy framework, we propose a fuzzy clustering method for imprecise data formalized in a fuzzy manner. In particular, in order to neutralize the negative effects of possible outlier fuzzy data in the clustering process, we proposed a robust fuzzy c-medoids clustering method for fuzzy data based on the combination of Huber's M-estimators and Yager's OWA (Ordered Weighted Averaging) operators. The proposed method is able to smooth the influence of anomalous data by means of a suitable parameter, the so-called typicality parameter, capable to tune the influence of the outliers. The performance of the proposed method has been shown by means of a simulation study, composed of experiments on: (i) simple two-dimensional dataset, (ii) benchmark datasets and (iii) the fuzzy-art-outliers dataset. The comparison made with the robust clustering methods known from the literature indicates the competitiveness of the introduced method to others. An application of the suggested method to a real dataset is also provided and the results of the method has been compared with other clustering methods suggested in the literature. In the application, the comparative assessment has shown the informational gain (in term of additional information) of the proposed method vs the other robust methods. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 28
页数:28
相关论文
共 59 条