Realization of Random Forest for Real-Time Evaluation through Tree Framing

被引:22
作者
Buschjaeger, Sebastian [1 ]
Chen, Kuan-Hsun [2 ]
Chen, Jian-Jia [2 ]
Morik, Katharina [1 ]
机构
[1] TU Dortmund Univ, Artificial Intelligence Unit, Dortmund, Germany
[2] TU Dortmund Univ, Design Automat Embedded Syst Grp, Dortmund, Germany
来源
2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM) | 2018年
关键词
random forest; decision trees; caching; computer architecture;
D O I
10.1109/ICDM.2018.00017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The optimization of learning has always been of particular concern for big data analytics. However, the ongoing integration of machine learning models into everyday life also demand the evaluation to be extremely fast and in real-time. Moreover, in the Internet of Things, the computing facilities that run the learned model are restricted. Hence, the implementation of the model application must take the characteristics of the executing platform into account Although there exist some heuristics that optimize the code, principled approaches for fast execution of learned models are rare. In this paper, we introduce a method that optimizes the execution of Decision Trees (DT). Decision Trees form the basis of many ensemble methods, such as Random Forests (RF) or Extremely Randomized Trees (ET). For these methods to work best, trees should be as large as possible. This challenges the data and the instruction cache of modern CPUs and thus demand a more careful memory layout. Based on a probabilistic view of decision tree execution, we optimize the two most common implementation schemes of decision trees. We discuss the advantages and disadvantages of both implementations and present a theoretically well-founded memory layout which maximizes locality during execution in both cases. The method is applied to three computer architectures, namely ARM (RISC), PPC (Extended RISC) and Intel (CISC) and is automatically adopted to the specific architecture by a code generator. We perform over 1800 experiments on several real-world data sets and report an average speed-up of 2 to 4 across all three architectures by using the proposed memory layout. Moreover, we find that our implementation outperforms sklearn, which was used to train the models by a factor of 1500.
引用
收藏
页码:19 / 28
页数:10
相关论文
共 50 条
  • [41] Real-time pattern recognition in statistical process control: a hybrid neural network/decision tree-based approach
    Guh, RS
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2005, 219 (03) : 283 - 298
  • [42] Construction and evaluation of prognostic models of ECMO in elderly patients with cardiogenic shock based on BP neural network, random forest, and decision tree
    Mo, Zucong
    Lu, Zheng
    Tang, Xiaogang
    Lin, Xuezhen
    Wang, Shuangquan
    Zhang, Yunli
    Huang, Zhai
    AMERICAN JOURNAL OF TRANSLATIONAL RESEARCH, 2023, 15 (07): : 4639 - 4648
  • [43] Real-time realization of Dynamic Programming using machine learning methods for IC engine waste heat recovery system power optimization
    Xu, Bin
    Rathod, Dhruvang
    Yebi, Adamu
    Filipi, Zoran
    APPLIED ENERGY, 2020, 262
  • [44] Cloud-Native Fog Robotics: Model-Based Deployment and Evaluation of Real-Time Applications
    Wen, Long
    Zhang, Yu
    Rickert, Markus
    Lin, Jianjie
    Pan, Fengjunjie
    Knoll, Alois
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 398 - 405
  • [45] Real-time EEG-based emotion recognition for neurohumanities: perspectives from principal component analysis and tree-based algorithms
    Blanco-Rios, Miguel Alejandro
    Candela-Leal, Milton Osiel
    Orozco-Romo, Cecilia
    Remis-Serna, Paulina
    Velez-Saboya, Carol Stefany
    Lozoya-Santos, Jorge de Jesus
    Cebral-Loureda, Manuel
    Ramirez-Moreno, Mauricio Adolfo
    FRONTIERS IN HUMAN NEUROSCIENCE, 2024, 18
  • [46] Evaluation of crop mapping on fragmented and complex slope farmlands through random forest and object-oriented analysis using unmanned aerial vehicles
    Lee, Re-Yang
    Chang, Kuo-Chen
    Ou, Deng-Yuan
    Hsu, Chia-Hui
    GEOCARTO INTERNATIONAL, 2020, 35 (12) : 1293 - 1310
  • [47] Real-Time Radiofrequency Ablation Lesion Depth Estimation Using Multi-frequency Impedance With a Deep Neural Network and Tree-Based Ensembles
    Besler, Emre
    Wang, Yearnchee Curtis
    Sahakian, Alan V.
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2020, 67 (07) : 1890 - 1899
  • [49] A Novel 5G-NR Resources Partitioning Framework Through Real-Time User-Provider Traffic Demand Analysis
    Maule, Massimiliano
    Vardakas, John S.
    Verikoukis, Christos
    IEEE SYSTEMS JOURNAL, 2022, 16 (04): : 5317 - 5328
  • [50] An Edge Computing-Based Factor-Aware Novel Framework for Early Detection and Classification of Melanoma Disease Through a Customized VGG16 Architecture With Privacy Preservation and Real-Time Analysis
    Almufareh, Maram Fahaad
    IEEE ACCESS, 2024, 12 : 113580 - 113596