Deep and interpretable regression models for ordinal outcomes

被引:34
作者
Kook, Lucas [1 ,2 ]
Herzog, Lisa [1 ,2 ]
Hothorn, Torsten [1 ]
Durr, Oliver [3 ]
Sick, Beate [1 ,2 ]
机构
[1] Univ Zurich, Epidemiol Biostat & Prevent Inst, Hirschengraben 84, CH-8001 Zurich, Switzerland
[2] Zurich Univ Appl Sci, Inst Data Anal & Proc Design, Rosenstr 3, CH-8400 Winterthur, Switzerland
[3] Konstanz Univ Appl Sci, Inst Opt Syst, Alfred Wachtel Str 8, D-78462 Constance, Germany
基金
瑞士国家科学基金会;
关键词
Deep learning; Interpretability; Distributional regression; Ordinal regression; Transformation models;
D O I
10.1016/j.patcog.2021.108263
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outcomes with a natural order commonly occur in prediction problems and often the available input data are a mixture of complex data like images and tabular predictors. Deep Learning (DL) models are state-of-the-art for image classification tasks but frequently treat ordinal outcomes as unordered and lack interpretability. In contrast, classical ordinal regression models consider the outcome's order and yield interpretable predictor effects but are limited to tabular data. We present ordinal neural network transformation models (omkams), which unite DL with classical ordinal regression approaches. ONTRAM5 are a special case of transformation models and trade off flexibility and interpretability by additively decomposing the transformation function into terms for image and tabular data using jointly trained neural networks. The performance of the most flexible ONTRAM is by definition equivalent to a standard multiclass DL model trained with cross-entropy while being faster in training when facing ordinal outcomes. Lastly, we discuss how to interpret model components for both tabular and image data on two publicly available datasets. (C) 2021 Published by Elsevier Ltd.
引用
收藏
页数:11
相关论文
共 35 条
[1]  
Abadi M, 2015, TensorFlow: LargeScale Machine Learning on Heterogeneous Systems
[2]  
Ali A, 2020, PR MACH LEARN RES, V119
[3]  
Amorim J.P., 2018, P EUR S ART NEUR NET
[4]  
Cao W., 2019, ARXIV190107884
[5]  
Cardoso JS, 2007, J MACH LEARN RES, V8, P1393
[6]   A Neural Network Approach to Ordinal Regression [J].
Cheng, Jianlin ;
Wang, Zheng ;
Pollastri, Gianluca .
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, :1279-1284
[7]  
Chollet F., 2015, Keras
[8]  
Chu W, 2005, J MACH LEARN RES, V6, P1019
[9]   Support vector ordinal regression [J].
Chu, Wei ;
Keerthi, S. Sathiya .
NEURAL COMPUTATION, 2007, 19 (03) :792-815
[10]   Modeling wine preferences by data mining from physicochemical properties [J].
Cortez, Paulo ;
Cerdeira, Antonio ;
Almeida, Fernando ;
Matos, Telmo ;
Reis, Jose .
DECISION SUPPORT SYSTEMS, 2009, 47 (04) :547-553