DeepTLF: robust deep neural networks for heterogeneous tabular data

被引:7
作者
Borisov, Vadim [1 ]
Broelemann, Klaus [2 ]
Kasneci, Enkelejda [1 ]
Kasneci, Gjergji [1 ,2 ]
机构
[1] Univ Tubingen, Tubingen, Germany
[2] SCHUFA Holding AG, Wiesbaden, Germany
关键词
Deep neural networks; Heterogeneous data; Tabular data; Tabular data encoding; Multimodal learning;
D O I
10.1007/s41060-022-00350-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although deep neural networks (DNNs) constitute the state of the art in many tasks based on visual, audio, or text data, their performance on heterogeneous, tabular data is typically inferior to that of decision tree ensembles. To bridge the gap between the difficulty of DNNs to handle tabular data and leverage the flexibility of deep learning under input heterogeneity, we propose DeepTLF, a framework for deep tabular learning. The core idea of our method is to transform the heterogeneous input data into homogeneous data to boost the performance of DNNs considerably. For the transformation step, we develop a novel knowledge distillations approach, TreeDrivenEncoder, which exploits the structure of decision trees trained on the available heterogeneous data to map the original input vectors onto homogeneous vectors that a DNN can use to improve the predictive performance. Within the proposed framework, we also address the issue of the multimodal learning, since it is challenging to apply decision tree ensemble methods when other data modalities are present. Through extensive and challenging experiments on various real-world datasets, we demonstrate that the DeepTLF pipeline leads to higher predictive performance. On average, our framework shows 19.6% performance improvement in comparison to DNNs. The DeepTLF code is publicly available.
引用
收藏
页码:85 / 100
页数:16
相关论文
共 61 条
[1]  
Akrami H., 2020, ARXIV
[2]  
[Anonymous], 2015, Intelligent Systems Reference Library, DOI DOI 10.1007/978-3-319-10247-4
[3]  
Arik SO, 2019, ARXIV
[4]  
Ballet V., 2019, ARXIV
[5]   TFX: A TensorFlow-Based Production-Scale Machine Learning Platform [J].
Baylor, Denis ;
Breck, Eric ;
Cheng, Heng-Tze ;
Fiedel, Noah ;
Foo, Chuan Yu ;
Haque, Zakaria ;
Haykal, Salem ;
Ispir, Mustafa ;
Jain, Vihan ;
Koc, Levent ;
Koo, Chiu Yuen ;
Lew, Lukasz ;
Mewald, Clemens ;
Modi, Akshay Naresh ;
Polyzotis, Neoklis ;
Ramesh, Sukriti ;
Roy, Sudip ;
Whang, Steven Euijong ;
Wicke, Martin ;
Wilkiewicz, Jarek ;
Zhang, Xin ;
Zinkevich, Martin .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :1387-1395
[6]  
Bergstra J., 2011, P 2011 ANN C NEURAL, V24, DOI DOI 10.5555/2986459.2986743
[7]  
Borisov V., ARXIV
[8]   Early, intermediate and late fusion strategies for robust deep learning-based multimodal action recognition [J].
Boulahia, Said Yacine ;
Amamra, Abdenour ;
Madi, Mohamed Ridha ;
Daikh, Said .
MACHINE VISION AND APPLICATIONS, 2021, 32 (06)
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]  
Brooks N., 2018, WOMENS E COMMERCE CL