A Novel Approach for Dealing with Missing Values in Machine Learning Datasets with Discrete Values

被引:4
作者
Abu-Soud, Saleh M. [1 ]
机构
[1] Princess Sumaya Univ Technol, Dept Software Engn, Amman, Jordan
来源
2019 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCIS) | 2019年
关键词
ILA; inductive learning; missing values; noisy data; imputation;
D O I
10.1109/iccisci.2019.8716430
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the problems that faces machine learning researchers is the incomplete datasets with missing values, knowing that most machine learning algorithms deal with complete datasets. ILA is one of these algorithms which deal only with datasets with complete discrete values. In this paper, a novel approach for dealing with missing values has been developed and tailored with ILA where the treatment of missing values is performed during the induction process. The proposed system is called ILA4. ILA4 has been tested on several datasets with different percentages of missing values. Its results also compared with some common methods for treating missing values. The results show that most of the time, the results ILA4 appear to be comparable to the best cases of some other well-known methods for dealing with missing values problem, namely; the most common value, the most common value restricted to a concept, and delete strategy.
引用
收藏
页码:118 / 122
页数:5
相关论文
共 14 条
[1]  
Abdelrazaq Duaa, 2018, INT ARAB J INFORM TE, V15
[2]  
Abu-Soud S, 2009, WSEAS T COMPUTERS, V8
[3]  
Abu-Soud S., 1997, P 10 INT C IND ENG A
[4]  
Abu-Soud S, 1999, J I MATH COMPUTER SC, V10, P201
[5]  
Abu-Soud S, 2000, AMSE J FRANCE DEC
[6]  
Abu-Soud S., 2016, WSEAS T INFORM SCI A, V13
[7]  
Abu-Soud Saleh M., 2018, International Journal of Circuits, Systems and Signal Processing, V12, P661
[8]  
Abu-Soud Saleh M., 2018, WSEAS Transactions on Systems and Control, V13, P171
[9]   ILATalk: a new multilingual text-to-speech synthesizer with machine learning [J].
Abu-Soud, Saleh M. .
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (01) :55-64
[10]  
Angelov Boyan, 2017, WORKING MISSING DATA