Disjunctive Rule Lists

被引:1
作者
Ragodos, Ronilo [1 ]
Wang, Tong [1 ]
机构
[1] Univ Iowa, Tippie Coll Business, Iowa City, IA 52242 USA
关键词
interpretable machine learning; decision rules; regression; REGRESSION; CLASSIFICATION; TREES; FRAMEWORK;
D O I
10.1287/ijoc.2022.1242
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this study, we present an interpretable model, disjunctive rule list (DisRL) for regression. This research is motivated by the increasing need for model interpretability, especially in high-stakes decisions such as medicine, where decisions are made on or related to humans. DisRL is a generalized form of rule lists. A DisRL model consists of a list of disjunctive rules embedded in an if-else logic structure that stratifies the data space. Compared with traditional decision trees and other rule list models in the literature that stratify the feature space with single itemsets (an itemset is a conjunction of conditions), each disjunctive rule in DisRL uses a set of itemsets to collectively cover a subregion in the feature space. In addition, a DisRL model is constructed under a global objective that balances the predictive performance and model complexity. To train a DisRL model, we devise a hierarchical stochastic local search algorithm that exploits the properties of DisRL's unique structure to improve search efficiency. The algorithm adopts the main structure of simulated annealing and customizes the proposing strategy for faster convergence. Meanwhile, the algorithm uses a prefix bound to locate a subset of the search area, effectively pruning the search space at each iteration. An ablation study shows the effectiveness of this strategy in pruning the search space. Experiments on public benchmark datasets demonstrate that DisRL outperforms baseline interpretable models, including decision trees and other rule-based regressors.
引用
收藏
页码:3259 / 3276
页数:18
相关论文
共 56 条
  • [1] AbdelWahab M E, 2018, Ann Burns Fire Disasters, V31, P83
  • [2] Bénard C, 2021, PR MACH LEARN RES, V130
  • [3] Benjaafar S, 2020, PREPRINTS, DOI DOI 10.2139/SSRN
  • [4] Predicting Inpatient Flow at a Major Hospital Using Interpretable Analytics
    Bertsimas, Dimitris
    Pauphilet, Jean
    Stevens, Jennifer
    Tandon, Manu
    [J]. M&SOM-MANUFACTURING & SERVICE OPERATIONS MANAGEMENT, 2022, 24 (06) : 2809 - 2824
  • [5] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669
  • [6] Chibante Rui., 2010, Simulated Annealing: Theory with Applications
  • [7] Simulated annealing: Searching for an optimal temperature schedule
    Cohn, H
    Fielding, M
    [J]. SIAM JOURNAL ON OPTIMIZATION, 1999, 9 (03) : 779 - 802
  • [8] Dash S, 2018, ADV NEUR IN, V31
  • [9] Doshi-Velez F, 2017, Arxiv, DOI [arXiv:1702.08608, DOI 10.48550/ARXIV.1702.08608]
  • [10] USE OF AUTOMATIC INTERACTION DETECTOR AND SIMILAR SEARCH PROCEDURES
    DOYLE, P
    [J]. OPERATIONAL RESEARCH QUARTERLY, 1973, 24 (03) : 465 - 467