A parallel method for computing rough set approximations

被引:91
作者
Zhang, Junbo [1 ]
Li, Tianrui [1 ]
Ruan, Da [2 ,3 ]
Gao, Zizhe [1 ]
Zhao, Chengbing [1 ]
机构
[1] SW Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 610031, Peoples R China
[2] Belgian Nucl Res Ctr SCK CEN, B-2400 Mol, Belgium
[3] Univ Ghent, Dept Appl Math & Comp Sci, B-9000 Ghent, Belgium
基金
美国国家科学基金会;
关键词
Rough sets; Data mining; Approximations; Hadoop; MapReduce; ATTRIBUTE REDUCTION; MAPREDUCE; ALGORITHM; MODEL;
D O I
10.1016/j.ins.2011.12.036
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Massive data mining and knowledge discovery present a tremendous challenge with the data volume growing at an unprecedented rate. Rough set theory has been successfully applied in data mining. The lower and upper approximations are basic concepts in rough set theory. The effective computation of approximations is vital for improving the performance of data mining or other related tasks. The recently introduced MapReduce technique has gained a lot of attention from the scientific community for its applicability in massive data analysis. This paper proposes a parallel method for computing rough set approximations. Consequently, algorithms corresponding to the parallel method based on the MapReduce technique are put forward to deal with the massive data. An extensive experimental evaluation on different large data sets shows that the proposed parallel method is effective for data mining. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:209 / 223
页数:15
相关论文
共 38 条
[1]  
[Anonymous], 1998, UCI REPOSITORY MACHI
[2]  
[Anonymous], 1991, ROUGH SETS THEORETIC
[3]  
[Anonymous], 2009, Hadoop: The Definitive Guide
[4]  
[Anonymous], 2011, Pei. data mining concepts and techniques
[5]   Scheduling divisible MapReduce computations [J].
Berlinska, J. ;
Drozdowski, M. .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2011, 71 (03) :450-459
[6]   Sequential covering rule induction algorithm for variable consistency rough set approaches [J].
Blaszczynski, Jerzy ;
Slowinski, Roman ;
Szelag, Marcin .
INFORMATION SCIENCES, 2011, 181 (05) :987-1002
[7]   A Rough Set Based Dynamic Maintenance Approach for Approximations in Coarsening and Refining Attribute Values [J].
Chen, Hongmei ;
Li, Tianrui ;
Qiao, Shaojie ;
Ruan, Da .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2010, 25 (10) :1005-1026
[8]   A hybrid model based on rough sets theory and genetic algorithms for stock price forecasting [J].
Cheng, Ching-Hsue ;
Chen, Tai-Liang ;
Wei, Liang-Ying .
INFORMATION SCIENCES, 2010, 180 (09) :1610-1629
[9]   Positive approximation and converse approximation in interval-valued fuzzy rough sets [J].
Cheng, Yi ;
Miao, Duoqian ;
Feng, Qinrong .
INFORMATION SCIENCES, 2011, 181 (11) :2086-2110
[10]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137