Building Decision Forest via Deep Reinforcement Learning

被引:0
作者
Hua, Hongzhi [1 ]
Wen, Guixuan [1 ]
Wu, Kaigui [1 ]
机构
[1] Chongqing Univ, Coll Comp Sci, Chongqing, Peoples R China
来源
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年
关键词
multi-agent deep reinforcement learning; ensemble learning; decision tree;
D O I
10.1109/IJCNN54540.2023.10191160
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble learning methods whose base classifier is a decision tree usually belong to the bagging or boosting. It is widely used in all aspects of machine learning and has made great achievements in classification problems. However, no previous work has ever built the ensemble classifier by maximizing long-term returns to the best of our knowledge. This paper proposes a decision forest building method called MA-HSAC-DF (Multi-agent Hybrid Soft Actor Critic based Decision Forest) for binary classification via deep reinforcement learning. First, the building process is modeled as a decentralized partial observable Markov decision process, and a set of cooperative agents jointly constructs all base classifiers. Second, the global state and local observations are defined based on information of the parent node and the current location. Last, the state-ofthe-art deep reinforcement method Hybrid SAC (Hybrid Soft Actor Critic) with hybrid action space is extended to a multiagent system under the CTDE (centralized training decentralized execution) architecture to find an optimal decision forest building policy. The experiments indicate that MA-H-SAC-DF has the same performance as random forest, Adaboost, and GBDT (Gradient Boosting Decision Tree) on balanced datasets and outperforms state-of-the-art ensemble learning algorithms on imbalanced datasets.
引用
收藏
页数:8
相关论文
共 25 条
  • [11] Fujimoto Scott, ICML 2018, P1582
  • [12] Karakoulas Grigoris I., 1998, NIPS 1998, P253
  • [13] Predicting restaurant financial distress using decision tree and AdaBoosted decision tree models
    Kim, Soo Y.
    Upneja, Arun
    [J]. ECONOMIC MODELLING, 2014, 36 : 354 - 362
  • [14] Koyuncugil A. Serhan, 2009, ICCES, V11, P39
  • [15] Liu XY, 2006, IEEE DATA MINING, P965
  • [16] Self-paced Ensemble for Highly Imbalanced Massive Data Classification
    Liu, Zhining
    Cao, Wei
    Gao, Zhifeng
    Bian, Jiang
    Chen, Hechang
    Chang, Yi
    Liu, Tie-Yan
    [J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 841 - 852
  • [17] Lowe Ryan, 2017, NIPS 2017, P6379
  • [18] Direct marketing decision support through predictive customer response modeling
    Olson, David L.
    Chae, Bongsug
    [J]. DECISION SUPPORT SYSTEMS, 2012, 54 (01) : 443 - 451
  • [19] Rashid Tabish, 2018, ICML 2018, P4292
  • [20] Ensemble learning: A survey
    Sagi, Omer
    Rokach, Lior
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (04)