Building Decision Forest via Deep Reinforcement Learning

被引:0
作者
Hua, Hongzhi [1 ]
Wen, Guixuan [1 ]
Wu, Kaigui [1 ]
机构
[1] Chongqing Univ, Coll Comp Sci, Chongqing, Peoples R China
来源
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年
关键词
multi-agent deep reinforcement learning; ensemble learning; decision tree;
D O I
10.1109/IJCNN54540.2023.10191160
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble learning methods whose base classifier is a decision tree usually belong to the bagging or boosting. It is widely used in all aspects of machine learning and has made great achievements in classification problems. However, no previous work has ever built the ensemble classifier by maximizing long-term returns to the best of our knowledge. This paper proposes a decision forest building method called MA-HSAC-DF (Multi-agent Hybrid Soft Actor Critic based Decision Forest) for binary classification via deep reinforcement learning. First, the building process is modeled as a decentralized partial observable Markov decision process, and a set of cooperative agents jointly constructs all base classifiers. Second, the global state and local observations are defined based on information of the parent node and the current location. Last, the state-ofthe-art deep reinforcement method Hybrid SAC (Hybrid Soft Actor Critic) with hybrid action space is extended to a multiagent system under the CTDE (centralized training decentralized execution) architecture to find an optimal decision forest building policy. The experiments indicate that MA-H-SAC-DF has the same performance as random forest, Adaboost, and GBDT (Gradient Boosting Decision Tree) on balanced datasets and outperforms state-of-the-art ensemble learning algorithms on imbalanced datasets.
引用
收藏
页数:8
相关论文
共 25 条
  • [1] Al Jarullah Asma A., 2011, 2011 International Conference on Innovations in Information Technology (IIT), P303, DOI 10.1109/INNOVATIONS.2011.5893838
  • [2] Decision tree classifiers for automated medical diagnosis
    Azar, Ahmad Taher
    El-Metwally, Shereen M.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2013, 23 (7-8) : 2387 - 2403
  • [3] Biau G, 2016, TEST-SPAIN, V25, P197, DOI 10.1007/s11749-016-0481-7
  • [4] SMOTEBoost: Improving prediction of the minority class in boosting
    Chawla, NV
    Lazarevic, A
    Hall, LO
    Bowyer, KW
    [J]. KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2003, PROCEEDINGS, 2003, 2838 : 107 - 119
  • [5] Cieslak DA, 2008, LECT NOTES ARTIF INT, V5211, P241, DOI 10.1007/978-3-540-87479-9_34
  • [6] Delalleau Olivier, 2019, ABS191211077 CORR
  • [7] A survey on multi-agent deep reinforcement learning: from the perspective of challenges and applications
    Du, Wei
    Ding, Shifei
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (05) : 3215 - 3238
  • [8] Fan W, 1999, MACHINE LEARNING, PROCEEDINGS, P97
  • [9] Foerster JN, 2018, AAAI CONF ARTIF INTE, P2974
  • [10] Greedy function approximation: A gradient boosting machine
    Friedman, JH
    [J]. ANNALS OF STATISTICS, 2001, 29 (05) : 1189 - 1232