Building Decision Forest via Deep Reinforcement Learning

被引：0

作者：

Hua, Hongzhi ^{[1
]}

Wen, Guixuan ^{[1
]}

Wu, Kaigui ^{[1
]}

机构：

[1] Chongqing Univ, Coll Comp Sci, Chongqing, Peoples R China

来源：

2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年

关键词：

multi-agent deep reinforcement learning; ensemble learning; decision tree;

D O I：

10.1109/IJCNN54540.2023.10191160

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Ensemble learning methods whose base classifier is a decision tree usually belong to the bagging or boosting. It is widely used in all aspects of machine learning and has made great achievements in classification problems. However, no previous work has ever built the ensemble classifier by maximizing long-term returns to the best of our knowledge. This paper proposes a decision forest building method called MA-HSAC-DF (Multi-agent Hybrid Soft Actor Critic based Decision Forest) for binary classification via deep reinforcement learning. First, the building process is modeled as a decentralized partial observable Markov decision process, and a set of cooperative agents jointly constructs all base classifiers. Second, the global state and local observations are defined based on information of the parent node and the current location. Last, the state-ofthe-art deep reinforcement method Hybrid SAC (Hybrid Soft Actor Critic) with hybrid action space is extended to a multiagent system under the CTDE (centralized training decentralized execution) architecture to find an optimal decision forest building policy. The experiments indicate that MA-H-SAC-DF has the same performance as random forest, Adaboost, and GBDT (Gradient Boosting Decision Tree) on balanced datasets and outperforms state-of-the-art ensemble learning algorithms on imbalanced datasets.

引用

页数：8

共 25 条

[11] Fujimoto Scott, ICML 2018, P1582
[12] Karakoulas Grigoris I., 1998, NIPS 1998, P253
[13] Predicting restaurant financial distress using decision tree and AdaBoosted decision tree models
Kim, Soo Y.
Upneja, Arun
[J]. ECONOMIC MODELLING, 2014, 36 : 354 - 362
[14] Koyuncugil A. Serhan, 2009, ICCES, V11, P39
[15] Liu XY, 2006, IEEE DATA MINING, P965
[16] Self-paced Ensemble for Highly Imbalanced Massive Data Classification
Liu, Zhining
Cao, Wei
Gao, Zhifeng
Bian, Jiang
Chen, Hechang
Chang, Yi
Liu, Tie-Yan
[J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 841 - 852
[17] Lowe Ryan, 2017, NIPS 2017, P6379
[18] Direct marketing decision support through predictive customer response modeling
Olson, David L.
Chae, Bongsug
[J]. DECISION SUPPORT SYSTEMS, 2012, 54 (01) : 443 - 451
[19] Rashid Tabish, 2018, ICML 2018, P4292
[20] Ensemble learning: A survey
Sagi, Omer
Rokach, Lior
[J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (04)

← 1 2 3 →