Trees with Attention for Set Prediction Tasks

被引:0
作者
Hirsch, Roy [1 ]
Gilad-Bachrach, Ran [2 ,3 ]
机构
[1] Tel Aviv Univ, Dept EE, Tel Aviv, Israel
[2] Tel Aviv Univ, Dept Biomed Engn, Tel Aviv, Israel
[3] Edmond J Safra Ctr Bioinformat, Tel Aviv, Israel
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139 | 2021年 / 139卷
关键词
REGRESSION TREES; NEURAL-NETWORKS; MACHINE; MODELS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many machine learning applications, each record represents a set of items. For example, when making predictions from medical records, the medications prescribed to a patient are a set whose size is not fixed and whose order is arbitrary. However, most machine learning algorithms are not designed to handle set structures and are limited to processing records of fixed size. Set-Tree, presented in this work, extends the support for sets to tree-based models, such as Random-Forest and Gradient-Boosting, by introducing an attention mechanism and set-compatible split criteria. We evaluate the new method empirically on a wide range of problems ranging from making predictions on sub-atomic particle jets to estimating the redshift of galaxies. The new method outperforms existing tree-based methods consistently and significantly. Moreover, it is competitive and often outperforms Deep Learning. We also discuss the theoretical properties of Set-Trees and explain how they enable item-level explainability.
引用
收藏
页数:12
相关论文
共 66 条
[1]   Playing tag with ANN: boosted top identification with pattern recognition [J].
Almeida, Leandro G. ;
Backovic, Mihailo ;
Cliche, Mathieu ;
Lee, Seung J. ;
Perelstein, Maxim .
JOURNAL OF HIGH ENERGY PHYSICS, 2015, (07)
[2]  
[Anonymous], 2012, ARXIV12020302
[3]  
[Anonymous], 2015, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2015.7298801
[4]  
[Anonymous], 2019, Top quark tagging reference dataset
[5]   The Physics of the B Factories [J].
Bevan, A. J. ;
Golob, B. ;
Mannel, Th. ;
Prell, S. ;
Yabsley, B. D. ;
Abe, K. ;
Aihara, H. ;
Anulli, F. ;
Arnaud, N. ;
Aushev, T. ;
Beneke, M. ;
Beringer, J. ;
Bianchi, F. ;
Bigi, I. I. ;
Bona, M. ;
Brambilla, N. ;
Brodzicka, J. ;
Chang, P. ;
Charles, M. J. ;
Cheng, C. H. ;
Cheng, H. -Y. ;
Chistov, R. ;
Colangelo, P. ;
Coleman, J. P. ;
Drutskoy, A. ;
Druzhinin, V. P. ;
Eidelman, S. ;
Eigen, G. ;
Eisner, A. M. ;
Faccini, R. ;
Flood, K. T. ;
Gambino, P. ;
Gaz, A. ;
Gradl, W. ;
Hayashii, H. ;
Higuchi, T. ;
Hulsbergen, W. D. ;
Hurth, T. ;
Iijima, T. ;
Itoh, R. ;
Jackson, P. D. ;
Kass, R. ;
Kolomensky, Yu. G. ;
Kou, E. ;
Krizan, P. ;
Kronfeld, A. ;
Kumano, S. ;
Kwon, Y. J. ;
Latham, T. E. ;
Leith, D. W. G. S. .
EUROPEAN PHYSICAL JOURNAL C, 2014, 74 (11) :I-898
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]  
Breiman L., 2017, Classification and Regression Trees, DOI 10.1201/9781315139470
[8]  
Cattral R., 2002, Recent Advances in Computers, Computing and Communications, V1, P296
[9]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[10]  
Cogan J., 2015, J HIGH ENERGY PHYS