Safe RuleFit: Learning Optimal Sparse Rule Model by Meta Safe Screening

被引：4

作者：

Kato, Hiroki ^{[1
]}

Hanada, Hiroyuki ^{[2
]}

Takeuchi, Ichiro ^{[2
,3
]}

机构：

[1] Nagoya Inst Technol, Dept Comp Sci, Nagoya, Aichi 4668555, Japan

[2] RIKEN, Ctr Adv Intelligence Project, Chuo, Tokyo 1030027, Japan

[3] Nagoya Univ, Grad Sch Engn, Nagoya, Aichi 4648601, Japan

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 02期

关键词：

Predictive models; Random forests; Dictionaries; Analytical models; Regression tree analysis; Pattern analysis; Numerical models; Machine learning; knowledge representation formalisms and methods; convex programming; combinatorial algorithms; REGRESSION;

D O I：

10.1109/TPAMI.2022.3167993

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We consider the problem of learning a sparse rule model, a prediction model in the form of a sparse linear combination of rules, where a rule is an indicator function defined over a hyper-rectangle in the input space. Since the number of all possible such rules is extremely large, it has been computationally intractable to select the optimal set of active rules. In this paper, to solve this difficulty for learning the optimal sparse rule model, we propose Safe RuleFit (SRF). Our basic idea is to develop meta safe screening (mSS), which is a non-trivial extension of well-known safe screening (SS) techniques. While SS is used for screening out one feature, mSS can be used for screening out multiple features by exploiting the inclusion-relations of hyper-rectangles in the input space. SRF provides a general framework for fitting sparse rule models for regression and classification, and it can be extended to handle more general sparse regularizations such as group regularization. We demonstrate the advantages of SRF through intensive numerical experiments.

引用

页码：2330 / 2343

页数：14

共 35 条

[21] INTERPRETABLE CLASSIFIERS USING RULES AND BAYESIAN ANALYSIS: BUILDING A BETTER STROKE PREDICTION MODEL
Letham, Benjamin
Rudin, Cynthia
McCormick, Tyler H.
Madigan, David
[J]. ANNALS OF APPLIED STATISTICS, 2015, 9 (03) : 1350 - 1371
[22] Safe Pattern Pruning: An Efficient Approach for Predictive Pattern Mining
Nakagawa, Kazuya
Suzumura, Shinya
Karasuyama, Masayuki
Tsuda, Koji
Takeuchi, Ichiro
[J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1785 - 1794
[23] Ndiaye E., 2016, P NIPS, P388
[24] Ndiaye E, 2015, ADV NEUR IN, V28
[25] Efficient mining of association rules using closed itemset lattices
Pasquier, N
Bastide, Y
Taouil, R
Lakhal, L
[J]. INFORMATION SYSTEMS, 1999, 24 (01) : 25 - 46
[26] Pedregosa F, 2011, J MACH LEARN RES, V12, P2825
[27] Rockafellar, 1970, CONVEX ANAL
[28] A Sparse-Group Lasso
Simon, Noah
Friedman, Jerome
Hastie, Trevor
Tibshirani, Robert
[J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2013, 22 (02) : 231 - 245
[29] Regression shrinkage and selection via the Lasso
Tibshirani, R
[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1996, 58 (01) : 267 - 288
[30] A coordinate gradient descent method for nonsmooth separable minimization
Tseng, Paul
Yun, Sangwoon
[J]. MATHEMATICAL PROGRAMMING, 2009, 117 (1-2) : 387 - 423

← 1 2 3 4 →