Trees with Attention for Set Prediction Tasks

被引:0
作者
Hirsch, Roy [1 ]
Gilad-Bachrach, Ran [2 ,3 ]
机构
[1] Tel Aviv Univ, Dept EE, Tel Aviv, Israel
[2] Tel Aviv Univ, Dept Biomed Engn, Tel Aviv, Israel
[3] Edmond J Safra Ctr Bioinformat, Tel Aviv, Israel
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139 | 2021年 / 139卷
关键词
REGRESSION TREES; NEURAL-NETWORKS; MACHINE; MODELS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many machine learning applications, each record represents a set of items. For example, when making predictions from medical records, the medications prescribed to a patient are a set whose size is not fixed and whose order is arbitrary. However, most machine learning algorithms are not designed to handle set structures and are limited to processing records of fixed size. Set-Tree, presented in this work, extends the support for sets to tree-based models, such as Random-Forest and Gradient-Boosting, by introducing an attention mechanism and set-compatible split criteria. We evaluate the new method empirically on a wide range of problems ranging from making predictions on sub-atomic particle jets to estimating the redshift of galaxies. The new method outperforms existing tree-based methods consistently and significantly. Moreover, it is competitive and often outperforms Deep Learning. We also discuss the theoretical properties of Set-Trees and explain how they enable item-level explainability.
引用
收藏
页数:12
相关论文
共 66 条
[21]   Jet flavor classification in high-energy physics with deep neural networks [J].
Guest, Daniel ;
Collado, Julian ;
Baldi, Pierre ;
Hsu, Shih-Chieh ;
Urban, Gregor ;
Whiteson, Daniel .
PHYSICAL REVIEW D, 2016, 94 (11)
[22]  
Guillame-Bert M., 2020, ARXIV200909991
[23]  
Guillame-Bert M, 2017, J MACH LEARN RES, V18
[24]   Comparison of random forest, artificial neural networks and support vector machine for intelligent diagnosis of rotating machinery [J].
Han, Te ;
Jiang, Dongxiang ;
Zhao, Qi ;
Wang, Lei ;
Yin, Kai .
TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2018, 40 (08) :2681-2693
[25]  
Hastie T., 2009, International Statistical Review, DOI [DOI 10.1007/978-0-387-84858-7, DOI 10.1111/J.1751-5823.2009.00095_18.X3]
[26]  
Ho TK, 1998, IEEE T PATTERN ANAL, V20, P832, DOI 10.1109/34.709601
[27]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[28]  
Huang Zhiheng, 2015, Computer Science
[29]   MIMIC-III, a freely accessible critical care database [J].
Johnson, Alistair E. W. ;
Pollard, Tom J. ;
Shen, Lu ;
Lehman, Li-wei H. ;
Feng, Mengling ;
Ghassemi, Mohammad ;
Moody, Benjamin ;
Szolovits, Peter ;
Celi, Leo Anthony ;
Mark, Roger G. .
SCIENTIFIC DATA, 2016, 3
[30]  
Kaggle, 2020, KAGGLE STATE MACHINE