A survey of distributed classification based ensemble data mining methods

被引:4
作者
Mokeddem, D. [1 ]
Belbachir, H. [1 ]
机构
[1] Laboratory of Signal, Systems and Databases LSSD, Department of Computer Sciences, University of Sciences and Technology Mohamed Boudiaf, El Mnaouer, Oran
关键词
Decision trees algorithm; Distributed data mining; Ensemble learning methods;
D O I
10.3923/jas.2009.3739.3745
中图分类号
学科分类号
摘要
Distributed classification is one task of distributed data mining which allows predicting if a data instance is member of a predefined class. It can be applied for two different objectives: the first is the desire to scale up algorithms to large data sets where the data are distributed in order to increase the overall efficiency; the second is the processing of data which are inherently distributed and autonomous. Ensemble learning methods as very promising techniques in terms of accuracy and also providing a distributed aspect, can be adapted to the distributed data mining. This study presents a survey of various approaches which use ensemble learning methods in a context of distributed classification, using as base classifier decision trees algorithm. According to the two objective mentioned above, the majority of work reported in the literature address the problem using one of the two techniques. The adaptation of ensemble learning methods to disjoint data sets, in the context of mining inherently distributed data and the parallelization of ensemble learning methods, in a scalability context. Through this survey, one can deduct that the work done in one or the other perspective (scaling up data mining algorithms or mining inherently distributed data) could be complementary. Some open questions, current debates and future directions are also discussed. © 2009 Asian Network for Scientific Information.
引用
收藏
页码:3739 / 3745
页数:6
相关论文
共 50 条
[41]   Privacy Preserving Mining of Distributed Data Using Steganography [J].
Kumari, D. Aruna ;
Rao, K. Raja Sekhar ;
Suman, M. .
RECENT TRENDS IN NETWORK SECURITY AND APPLICATIONS, 2010, 89 :263-269
[42]   International Workshop on Federated Learning for Distributed Data Mining [J].
Hong, Junyuan ;
Zhu, Zhuangdi ;
Lyu, Lingjuan ;
Zhou, Yang ;
Boddeti, Vishnu Naresh ;
Zhou, Jiayu .
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, :5861-5862
[43]   On designing and composing Grid Services for distributed data mining [J].
Congiusta, A ;
Talia, D ;
Trunfio, P .
FUTURE GENERATION GRIDS, 2006, :113-+
[44]   Distributed anonymous data perturbation method for privacy-preserving data mining [J].
Li, Feng ;
Ma, Jin ;
Li, Jian-hua .
JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A, 2009, 10 (07) :952-963
[45]   Distributed anonymous data perturbation method for privacy-preserving data mining [J].
Feng Li ;
Jin Ma ;
Jian-hua Li .
Journal of Zhejiang University-SCIENCE A, 2009, 10 :952-963
[46]   Distributed anonymous data perturbation method for privacy-preserving data mining [J].
Feng LI Jin MA Jianhua LI School of Electronic Information and Electrical Engineering Shanghai Jiao Tong University Shanghai China .
Journal of Zhejiang University(Science A:An International Applied Physics & Engineering Journal), 2009, (07) :952-963
[47]   Optimization and Scheduling Algorithm for Data Intensive Workflows in Distributed Data Mining Architecture [J].
Kakasevski, Gorgi ;
Mishev, Anastas .
17TH IEEE INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES - IEEE EUROCON 2017 CONFERENCE PROCEEDINGS, 2017, :775-780
[48]   Research on Distributed Data Mining Tool Used in Control System [J].
Ma, Yuekun ;
Li, Zhigang ;
Yu, Shuh ;
Chen, Lei .
INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL : ICACC 2009 - PROCEEDINGS, 2009, :383-+
[49]   Data Mining Technique for Reduction of Association Rules in Distributed System [J].
Waghamare, Bhagyashri ;
Bodhe, Yogesh .
2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), 2016, :415-418
[50]   Distributed mining of maximal frequent itemsets on a Data Grid system [J].
Luo, Congnan ;
Pereira, Anil L. ;
Chung, Soon M. .
JOURNAL OF SUPERCOMPUTING, 2006, 37 (01) :71-90