Mining Optimal Decision Trees from Itemset Lattices

被引:0
|
作者
Nijssen, Siegfried [1 ]
Fromont, Elisa [1 ]
机构
[1] Katholieke Univ Leuven, Leuven, Belgium
来源
KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2007年
关键词
Decision trees; Frequent itemsets; Formal concepts; Constraint-based mining;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present DL8, an exact algorithm for finding a decision tree that optimizes a ranking function under size, depth, accuracy and leaf constraints. Because the discovery of optimal trees has high theoretical complexity, until now few efforts have been made to compute such trees for real-world datasets. An exact algorithm is of both scientific and practical interest. Front a scientific point of view, it can be used as a gold standard to evaluate the performance of heuristic constraint-based decision tree learners and to gain new insight in traditional decision tree learners. From the application point of view, it can be used to discover trees that cannot be found by heuristic decision tree learners. The key idea behind our algorithm is that there is a relation between constraints on decision trees and constraints on itemsets. We show that optimal decision trees can be extracted from lattices of itemsets in linear time. We give several strategies to efficiently build these lattices. Experiments show that under the same constraints, DL8 obtains better results than C4.5, which confirms that exhaustive search does not always imply overfitting. The results also show that DL8 is a useful and interesting tool to learn decision trees under constraints.
引用
收藏
页码:530 / 539
页数:10
相关论文
共 50 条
  • [1] Optimal constraint-based decision tree induction from itemset lattices
    Nijssen, Siegfried
    Fromont, Elisa
    DATA MINING AND KNOWLEDGE DISCOVERY, 2010, 21 (01) : 9 - 51
  • [2] Optimal constraint-based decision tree induction from itemset lattices
    Siegfried Nijssen
    Elisa Fromont
    Data Mining and Knowledge Discovery, 2010, 21 : 9 - 51
  • [3] The Choice of Optimal Algorithm for Frequent Itemset Mining
    Busarov, Vyacheslav
    Grafeeva, Natalia
    Mikhailova, Elena
    DATABASES AND INFORMATION SYSTEMS IX, 2016, 291 : 211 - 224
  • [4] Optimal Decision Trees Generation from OR-Decision Tables
    Grana, Costantino
    Montangero, Manuela
    Borghesani, Daniele
    Cucchiara, Rita
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2011, PT I, 2011, 6978 : 443 - 452
  • [5] Weighted Support Association Rule Mining using Closed Itemset Lattices in Parallel
    Rahman, A. M. J. Md. Zubair
    Balasubramanie, P.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (03): : 247 - 253
  • [6] Inducing decision trees via concept lattices
    Belohlavek, Radim
    De Baets, Bernard
    Outrata, Jan
    Vychodil, Vilem
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2009, 38 (04) : 455 - 467
  • [7] Optimal multivariate decision trees
    Boutilier, Justin
    Michini, Carla
    Zhou, Zachary
    CONSTRAINTS, 2023, 28 (04) : 549 - 577
  • [8] Optimal multivariate decision trees
    Justin Boutilier
    Carla Michini
    Zachary Zhou
    Constraints, 2023, 28 : 549 - 577
  • [9] A survey of itemset mining
    Fournier-Viger, Philippe
    Lin, Jerry Chun-Wei
    Bay Vo
    Tin Truong Chi
    Zhang, Ji
    Hoai Bac Le
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2017, 7 (04)
  • [10] Assessing Optimal Forests of Decision Trees
    Aglin, Gael
    Nijssen, Siegfried
    Schaus, Pierre
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 32 - 39