Missing data imputation using fuzzy-rough methods

被引:104
作者
Amiri, Mehran [1 ]
Jensen, Richard [2 ]
机构
[1] Islamic Azad Univ, Sci & Res Branch, Dept Comp Engn, Kerman, Iran
[2] Aberystwyth Univ, Dept Comp Sci, Ceredigion SY23 3DB, Wales
关键词
Missing value imputation; Fuzzy-rough sets; Vaguely quantified rough sets; Ordered weighted average-based rough sets; VALUES;
D O I
10.1016/j.neucom.2016.04.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Missing values exist in many generated datasets in science. Therefore, utilizing missing data imputation methods is a common and important practice. These methods are a kind of treatment for uncertainty and vagueness existing in datasets. On the other hand, methods based on fuzzy-rough sets provide excellent tools for dealing with uncertainty, possessing highly desirable properties such as robustness and noise tolerance. Furthermore, they can find minimal representations of data and do not need potentially erroneous user inputs. As a result, utilizing fuzzy-rough sets for imputation should be an effective approach. In this paper, we propose three missing value imputation methods based on fuzzy-rough sets and its recent extensions; namely, implicator/t-norm based fuzzy-rough sets, vaguely quantified rough sets and also ordered weighted average based rough sets. These methods are compared against 11 stateof-the-art imputation methods implemented in the KEEL data mining software on 27 benchmark datasets. The results show, via non-parametric statistical analysis, that the proposed methods exhibit excellent performance in general. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:152 / 164
页数:13
相关论文
共 43 条
[1]  
Abdella M, 2005, ICCC 2005: IEEE 3rd International Conference on Computational Cybernetics, P207
[2]  
Al Shalabi L., 2006, Journal of Computer Sciences, V2, P735, DOI 10.3844/jcssp.2006.735.739
[3]  
Alcalá-Fdez J, 2011, J MULT-VALUED LOG S, V17, P255
[4]  
[Anonymous], 2014, STAT ANAL MISSING DA
[5]  
Aydilek IB, 2012, INT J INNOV COMPUT I, V8, P4705
[6]  
Batista GEAPA, 2003, APPL ARTIF INTELL, V17, P519, DOI 10.1080/08839510390219309
[7]  
Bengio Y, 1996, ADV NEUR IN, V8, P395
[8]  
Cornelis C, 2007, LECT NOTES ARTIF INT, V4482, P87
[9]  
Cornelis C, 2010, LECT NOTES ARTIF INT, V6401, P78, DOI 10.1007/978-3-642-16248-0_16
[10]   ROUGH FUZZY-SETS AND FUZZY ROUGH SETS [J].
DUBOIS, D ;
PRADE, H .
INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 1990, 17 (2-3) :191-209