An RST based Efficient Preprocessing Technique for Handling Inconsistent Data

被引:0
作者
Samsani, Surekha [1 ]
机构
[1] JNTUK Univ Coll Engn Vizianagaram, Dept Comp Sci & Engn, Vizianagaram, India
来源
2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH | 2016年
关键词
Data Preprocessing; Knowledge Discovery; Rough Set Theory; Inconsistent Data;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data Preprocessing is an essential and primary step in the process of knowledge discovery; because the data obtained from the logs may be incomplete, noisy or inconsistent. The quality of the training data plays a vital role in the success of the data mining algorithms thus; Data Preprocessing should not be an exception in the process of knowledge discovery. The most promising attributes of the quality data includes completeness, consistency and timeliness. Mainly the existence of Inconsistent data misguides the mining algorithm and indirectly affects the performance of the data mining algorithm. This paper gives, Rough Set Theory based approach for identifying and dealing with such inconsistencies in the given dataset and its performance is tested by submitting the preprocessed data to the tree based classifier. Finally, the experimental results revealed the importance of data preprocessing.
引用
收藏
页码:298 / 305
页数:8
相关论文
共 9 条
[1]  
[Anonymous], 2007, INT J COMPUTER INFOR
[2]  
Fayyad U, 1996, AI MAG, V17, P37
[3]  
Hall M., 2009, SIGKDD EXPLORATIONS, V11, P10, DOI [DOI 10.1145/1656274.1656278, 10.1145/1656274.1656278]
[4]  
Mitchell T. M., 1997, MACH LEARN, pXVII
[5]  
Pattaraintakorn P, APPL MATH LETT, P400
[6]   ROUGH SETS [J].
PAWLAK, Z .
INTERNATIONAL JOURNAL OF COMPUTER & INFORMATION SCIENCES, 1982, 11 (05) :341-356
[7]   Rough set approach to knowledge-based decision support [J].
Pawlak, Z .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1997, 99 (01) :48-57
[8]  
Rockah Lior, MACHINE PERCEPTION A, V69
[9]  
Surekha S., 2015, P INT C INT SYST CON, P20