Collaborative rule generation: An ensemble learning approach

被引:9
作者
Liu, Han [1 ]
Gegov, Alexander [1 ]
Cocea, Mihaela [1 ]
机构
[1] Univ Portsmouth, Sch Comp, Buckingham Bldg,Lion Terrace, Portsmouth PO1 3HE, Hants, England
关键词
Data mining; machine learning; ensemble learning; rule based systems; rule based classification; if-then rules;
D O I
10.3233/IFS-151997
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the vast and rapid increase in data, data mining has become an increasingly important tool for the purpose of knowledge discovery in order to prevent the presence of rich data but poor knowledge. Data mining tasks can be undertaken in two ways, namely, manual walkthrough of data and use of machine learning approaches. Due to the presence of big data, machine learning has thus become a powerful tool to do data mining in intelligent ways. A popular approach of machine learning is inductive learning, which can be used to generate a rule set (a set of rules) using a particular algorithm. Inductive learning can involve a single base algorithm learning from a single data set following a standard learning approach. In this approach, the learning algorithm can generate a single rule set such as decision trees. On the other hand, the inductive learning can also involve a single base algorithm learning from multiple data sets following an ensemble learning approach. In this approach, the learning algorithm can generate multiple rule sets such as random forests. The latter approach is usually designed to reduce overfitting of models that usually arises when the former approach is adopted. In this context, the ensemble learning approach usually enables the improvement of the overall accuracy in prediction. The aim of this paper is to introduce a new approach of ensemble learning called Collaborative Rule Generation. In the new approach, the inductive learning involves multiple base algorithms learning from a single data set to generate a single rule set, which aims to enable each rule to have a higher quality. This paper also includes an experimental study validating the Collaborative Rule Generation approach and discusses the results in both quantitative and qualitative ways.
引用
收藏
页码:2277 / 2287
页数:11
相关论文
共 24 条
  • [1] Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
  • [2] [Anonymous], 2006, Introduction to Data Mining
  • [3] Bache K., 2013, UCI Machine Learning Repository
  • [4] Brain D., 1996, MACHINE LEARNING
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [7] Brin S., 1997, P ACM SIGMOD INT C M
  • [8] PRISM - AN ALGORITHM FOR INDUCING MODULAR RULES
    CENDROWSKA, J
    [J]. INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1987, 27 (04): : 349 - 370
  • [9] Fayyad U, 1996, AI MAG, V17, P37
  • [10] Freund Y., 1996, Machine Learning. Proceedings of the Thirteenth International Conference (ICML '96), P148