A lazy approach to associative classification

被引:53
|
作者
Baralis, Elena [1 ]
Chiusano, Silvia [1 ]
Garza, Paolo [1 ]
机构
[1] Politecn Torino, Dipartimento Automat & Informat, I-10129 Turin, Italy
关键词
data mining; associative classification; association rules; condensed representations;
D O I
10.1109/TKDE.2007.190677
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Associative classification is a promising technique to build accurate classifiers. However, in large or correlated data sets, association rule mining may yield huge rule sets. Hence, several pruning techniques have been proposed to select a small subset of high-quality rules. Since the availability of a "rich" rule set may improve the accuracy of the classifier, we argue that rule pruning should be reduced to a minimum. The L-3 associative classifier is built by means of a lazy pruning technique that discards exclusively rules that only misclassify training data. The classification of unlabeled data is performed in two steps. A small subset of high-quality rules is first considered. When this set is not able to classify the data, a larger rule set is exploited. This second set includes rules usually discarded by previous approaches. To cope with the need of mining large rule sets and to efficiently use them for classification, a compact form is proposed to represent a complete rule set in a space-efficient way and without information loss. An extensive experimental evaluation on real and synthetic data sets shows that L-3 improves the classification accuracy with respect to previous approaches.
引用
收藏
页码:156 / 171
页数:16
相关论文
共 50 条
  • [41] Associative classification model for forecasting stock market trends
    Tetila E.C.
    MacHado B.B.
    Rorigues J.F.
    Zanoni D.A.
    Belete N.A.D.S.
    Zardo T.
    Constantino M.
    Pistori H.
    International Journal of Business Intelligence and Data Mining, 2021, 19 (01) : 97 - 112
  • [42] A Cost-Sensitive Based Approach for Improving Associative Classification on Imbalanced Datasets
    Waiyamai, Kitsana
    Suwannarattaphoom, Phoonperm
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2014, 2014, 8556 : 31 - 42
  • [43] Associative Classification in Big Data through a G3P Approach
    Luna, J. M.
    Padillo, F.
    Ventura, S.
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS 2019), 2019, : 94 - 102
  • [44] Classification with Evidential Associative Rules
    Samet, Ahmed
    Lefevre, Eric
    Ben Yahia, Sadok
    INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS, PT I, 2014, 442 : 25 - 35
  • [45] LAC: Library for associative classification
    Padillo, Francisco
    Maria Luna, Jose
    Ventura, Sebastian
    KNOWLEDGE-BASED SYSTEMS, 2020, 193
  • [46] Multiple labels associative classification
    Fadi Abdeljaber Thabtah
    Peter Cowling
    Yonghong Peng
    Knowledge and Information Systems, 2006, 9 : 109 - 129
  • [47] DAC: Discriminative Associative Classification
    Seyfi M.
    Xu Y.
    Nayak R.
    SN Computer Science, 4 (4)
  • [48] Multiple labels associative classification
    Thabtah, FA
    Cowling, P
    Peng, YH
    KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 9 (01) : 109 - 129
  • [49] Parallel Associative Classification Data Mining Frameworks Based MapReduce
    Thabtah, Fadi
    Hammoud, Suhel
    Abdel-Jaber, Hussein
    PARALLEL PROCESSING LETTERS, 2015, 25 (02)
  • [50] A new fast associative classification algorithm for detecting phishing websites
    Hadi, Wa'el
    Aburub, Faisal
    Alhawari, Samer
    APPLIED SOFT COMPUTING, 2016, 48 : 729 - 734