Calibrated lazy associative classification

被引:21
|
作者
Veloso, Adriano [1 ]
Meira, Wagner, Jr. [1 ]
Goncalves, Marcos [1 ]
Almeida, Humberto M. [1 ]
Zaki, Mohammed [2 ]
机构
[1] Univ Fed Minas Gerais, Dept Comp Sci, Belo Horizonte, MG, Brazil
[2] Rensselaer Polytech Inst, Dept Comp Sci, Troy, NY 12181 USA
关键词
Classification; MDL; Calibration;
D O I
10.1016/j.ins.2010.03.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Classification is a popular machine learning task. Given an example x and a class c, a classifier usually works by estimating the probability of x being member of c (i.e., membership probability). Well calibrated classifiers are those able to provide accurate estimates of class membership probabilities, that is, the estimated probability (p) over cap (c vertical bar x) is close to p(c vertical bar(p) over cap (c vertical bar x)), which is the true, (unknown) empirical probability of x being member of c given that the probability estimated by the classifier is (p) over cap (c vertical bar x). Calibration is not a necessary property for producing accurate classifiers, and, thus, most of the research has focused on direct accuracy maximization strategies rather than on calibration. However, non-calibrated classifiers are problematic in applications where the reliability associated with a prediction must be taken into account. In these applications, a sensible use of the classifier must be based on the reliability of its predictions, and, thus, the classifier must be well calibrated. In this paper we show that lazy associative classifiers (LAC) are well calibrated using an MM.:based entropy minimization method. We investigate important applications where such characteristics (i.e., accuracy and calibration) are relevant, and we demonstrate empirically that LAC outperforms other classifiers, such as SVMs, Naive Bayes, and Decision Trees (even after these classifiers are calibrated). Additional highlights of LAC include the ability to incorporate reliable predictions for improving training, and the ability to refrain from doubtful predictions. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:2656 / 2670
页数:15
相关论文
共 50 条
  • [31] LAC: Library for associative classification
    Padillo, Francisco
    Maria Luna, Jose
    Ventura, Sebastian
    KNOWLEDGE-BASED SYSTEMS, 2020, 193
  • [32] Associative classification in text categorization
    Chen, J
    Yin, J
    Zhang, J
    Huang, J
    ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 1035 - 1044
  • [33] Multiple labels associative classification
    Fadi Abdeljaber Thabtah
    Peter Cowling
    Yonghong Peng
    Knowledge and Information Systems, 2006, 9 : 109 - 129
  • [34] A review of associative classification mining
    Thabtah, Fadi
    KNOWLEDGE ENGINEERING REVIEW, 2007, 22 (01): : 37 - 65
  • [35] Associative classification with prediction confidence
    Do, Tien Dung
    Hui, Siu Cheung
    Fong, Alvis C. M.
    ADVANCES IN MACHINE LEARNING AND CYBERNETICS, 2006, 3930 : 199 - 208
  • [36] Classification of the associative number system
    Schouten, JA
    MATHEMATISCHE ANNALEN, 1915, 76 : 1 - 66
  • [37] Multiple labels associative classification
    Thabtah, FA
    Cowling, P
    Peng, YH
    KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 9 (01) : 109 - 129
  • [38] An approach for adaptive associative classification
    Wang, Xiaofeng
    Yue, Kun
    Niu, WenJia
    Shi, Zhongzhi
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (09) : 11873 - 11883
  • [39] Lazy, Lazy, Lazy
    Mahler, Nicolas
    Kilic, Ilse
    Widhalm, Franz
    Strobl, Edda
    Kaplan, Helmut
    Hofer, Regina
    Maurer, Leopold
    Konrad, Fatzinek Michaela
    Falkner, Brigitta
    Wolf, Heinz
    Lust, Ulli
    Suess, Franz
    LITERATUR UND KRITIK, 2018, (525): : 31 - +
  • [40] Lazy attribute selection: Choosing attributes at classification time
    Pereira, Rafael B.
    Plastino, Alexandre
    Zadrozny, Bianca
    Merschmann, Luiz Henrique de C.
    Freitas, Alex A.
    INTELLIGENT DATA ANALYSIS, 2011, 15 (05) : 715 - 732