Calibrated lazy associative classification

被引：21

作者：

Veloso, Adriano ^{[1
]}

Meira, Wagner, Jr. ^{[1
]}

Goncalves, Marcos ^{[1
]}

Almeida, Humberto M. ^{[1
]}

Zaki, Mohammed ^{[2
]}

机构：

[1] Univ Fed Minas Gerais, Dept Comp Sci, Belo Horizonte, MG, Brazil

[2] Rensselaer Polytech Inst, Dept Comp Sci, Troy, NY 12181 USA

来源：

INFORMATION SCIENCES | 2011年 / 181卷 / 13期

关键词：

Classification; MDL; Calibration;

D O I：

10.1016/j.ins.2010.03.007

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Classification is a popular machine learning task. Given an example x and a class c, a classifier usually works by estimating the probability of x being member of c (i.e., membership probability). Well calibrated classifiers are those able to provide accurate estimates of class membership probabilities, that is, the estimated probability (p) over cap (c vertical bar x) is close to p(c vertical bar(p) over cap (c vertical bar x)), which is the true, (unknown) empirical probability of x being member of c given that the probability estimated by the classifier is (p) over cap (c vertical bar x). Calibration is not a necessary property for producing accurate classifiers, and, thus, most of the research has focused on direct accuracy maximization strategies rather than on calibration. However, non-calibrated classifiers are problematic in applications where the reliability associated with a prediction must be taken into account. In these applications, a sensible use of the classifier must be based on the reliability of its predictions, and, thus, the classifier must be well calibrated. In this paper we show that lazy associative classifiers (LAC) are well calibrated using an MM.:based entropy minimization method. We investigate important applications where such characteristics (i.e., accuracy and calibration) are relevant, and we demonstrate empirically that LAC outperforms other classifiers, such as SVMs, Naive Bayes, and Decision Trees (even after these classifiers are calibrated). Additional highlights of LAC include the ability to incorporate reliable predictions for improving training, and the ability to refrain from doubtful predictions. (C) 2010 Elsevier Inc. All rights reserved.

引用

页码：2656 / 2670

页数：15

共 50 条

[31] LAC: Library for associative classification
Padillo, Francisco
Maria Luna, Jose
Ventura, Sebastian
KNOWLEDGE-BASED SYSTEMS, 2020, 193
[32] Associative classification in text categorization
Chen, J
Yin, J
Zhang, J
Huang, J
ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 1035 - 1044
[33] Multiple labels associative classification
Fadi Abdeljaber Thabtah
Peter Cowling
Yonghong Peng
Knowledge and Information Systems, 2006, 9 : 109 - 129
[34] A review of associative classification mining
Thabtah, Fadi
KNOWLEDGE ENGINEERING REVIEW, 2007, 22 (01): : 37 - 65
[35] Associative classification with prediction confidence
Do, Tien Dung
Hui, Siu Cheung
Fong, Alvis C. M.
ADVANCES IN MACHINE LEARNING AND CYBERNETICS, 2006, 3930 : 199 - 208
[36] Classification of the associative number system
Schouten, JA
MATHEMATISCHE ANNALEN, 1915, 76 : 1 - 66
[37] Multiple labels associative classification
Thabtah, FA
Cowling, P
Peng, YH
KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 9 (01) : 109 - 129
[38] An approach for adaptive associative classification
Wang, Xiaofeng
Yue, Kun
Niu, WenJia
Shi, Zhongzhi
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (09) : 11873 - 11883
[39] Lazy, Lazy, Lazy
Mahler, Nicolas
Kilic, Ilse
Widhalm, Franz
Strobl, Edda
Kaplan, Helmut
Hofer, Regina
Maurer, Leopold
Konrad, Fatzinek Michaela
Falkner, Brigitta
Wolf, Heinz
Lust, Ulli
Suess, Franz
LITERATUR UND KRITIK, 2018, (525): : 31 - +
[40] Lazy attribute selection: Choosing attributes at classification time
Pereira, Rafael B.
Plastino, Alexandre
Zadrozny, Bianca
Merschmann, Luiz Henrique de C.
Freitas, Alex A.
INTELLIGENT DATA ANALYSIS, 2011, 15 (05) : 715 - 732

← 1 2 3 4 5 →