Highly adaptive regression trees

被引:0
作者
Nizam, Sohail [1 ]
Benkeser, David [1 ]
机构
[1] Emory Univ, Rollins Sch Publ Hlth, Dept Biostat & Bioinformat, 1518 Clifton Rd, Atlanta, GA 30322 USA
基金
美国国家科学基金会;
关键词
Machine learning; Interpretability; Decision trees; Recursive partitioning; Highly adaptive lasso; MACHINE LEARNING-METHODS; DECISION TREES;
D O I
10.1007/s12065-023-00836-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The development of machine learning methods that are both accurate and interpretable is of paramount importance in healthcare and many other fields. The Highly Adaptive Lasso (HAL) has been shown to have predictive performance on par with state-of-the art algorithms. HAL involves performing regularized regression of the outcome on a tensor product of indicator basis functions. In this paper we show that this basis can be represented as a non-recursive partitioning of the feature space and propose a method for mapping this partitioning implied by HAL to a recursive partitioning. Such a mapping then allows for the representation of HAL as a decision tree, thereby providing interpretability of predictions made by the algorithm. We refer to this post-hoc method for interpretability as Highly Adaptive Regression Trees (HART). We provide a set of algorithms to construct this mapping and conveniently visualize the resulting tree. Using real data, we show that HAL's predictive performance is on par with state-of-the-art methods, and we demonstrate the construction and interpretation of HARTs.
引用
收藏
页码:535 / 547
页数:13
相关论文
共 28 条
[1]   Use of a machine learning framework to predict substance use disorder treatment success [J].
Acion, Laura ;
Kelmansky, Diana ;
van der Laan, Mark ;
Sahker, Ethan ;
Jones, DeShauna ;
Arndt, Stephan .
PLOS ONE, 2017, 12 (04)
[2]  
Aeberhard S., 1992, Tech. Rep, V92, DOI [DOI 10.1016/0031-3203(94)90145-7, 10.1016/0031-3203(94) 90145-7]
[3]   Fundamentals and exchange rate forecastability with simple machine learning methods [J].
Amat, Christophe ;
Michalski, Tomasz ;
Stoltz, Gilles .
JOURNAL OF INTERNATIONAL MONEY AND FINANCE, 2018, 88 :1-24
[4]  
[Anonymous], 1980, Appl Stat, DOI DOI 10.2307/2986296
[5]  
Ayres-de Campos D, 2000, J Matern Fetal Med, V9, P311
[6]   Machine Learning Methods for Demand Estimation [J].
Bajari, Patrick ;
Nekipelov, Denis ;
Ryan, Stephen P. ;
Yang, Miaoyu .
AMERICAN ECONOMIC REVIEW, 2015, 105 (05) :481-485
[7]   The Highly Adaptive Lasso Estimator [J].
Benkeser, David ;
van der Laan, Mark .
PROCEEDINGS OF 3RD IEEE/ACM INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, (DSAA 2016), 2016, :689-696
[8]   Improving the variable ordering of OBDDs is NP-complete [J].
Bollig, B ;
Wegener, I .
IEEE TRANSACTIONS ON COMPUTERS, 1996, 45 (09) :993-1002
[9]  
Breiman L., 1984, Classification and regression trees, DOI DOI 10.1201/9781315139470
[10]   Decision tree-based classifier in providing telehealth service [J].
Chern, Ching-Chin ;
Chen, Yu-Jen ;
Hsiao, Bo .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (1)