Deep reinforcement learning for imbalanced classification

被引:137
作者
Lin, Enlu [1 ]
Chen, Qiong [1 ]
Qi, Xiaoming [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
关键词
Imbalanced classification; Deep reinforcement learning; Reward function; Classification policy;
D O I
10.1007/s10489-020-01637-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data in real-world application often exhibit skewed class distribution which poses an intense challenge for machine learning. Conventional classification algorithms are not effective in case of imbalanced data distribution, and may fail when the data distribution is highly imbalanced. To address this issue, we propose a general imbalanced classification model based on deep reinforcement learning, in which we formulate the classification problem as a sequential decision-making process and solve it by a deep Q-learning network. In our model, the agent performs a classification action on one sample in each time step, and the environment evaluates the classification action and returns a reward to the agent. The reward from the minority class sample is larger, so the agent is more sensitive to the minority class. The agent finally finds an optimal classification policy in imbalanced data under the guidance of the specific reward function and beneficial simulated environment. Experiments have shown that our proposed model outperforms other imbalanced classification algorithms, and identifies more minority samples with better classification performance.
引用
收藏
页码:2488 / 2502
页数:15
相关论文
共 44 条
[1]  
Abdi L., 2014, P 3 INT C SOFT COMP, P589
[2]   Balanced undersampling: a novel sentence-based undersampling method to improve recognition of named entities in chemical and biomedical text [J].
Akkasi, Abbas ;
Varoglu, Ekrem ;
Dimililer, Nazife .
APPLIED INTELLIGENCE, 2018, 48 (08) :1965-1978
[3]  
[Anonymous], 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence
[4]  
[Anonymous], 2017, ARXIV171107364
[5]  
Batista G. E. A. P. A., 2004, ACM SIGKDD Explor Newsl, V6, P20, DOI [10.1145/1007730.1007735, DOI 10.1145/1007730.1007735]
[6]  
Benavoli A, 2016, J MACH LEARN RES, V17
[7]  
Bengio Yoshua, 2012, Neural Networks: Tricks of the Trade. Second Edition: LNCS 7700, P437, DOI 10.1007/978-3-642-35289-8_26
[8]   Decision threshold adjustment in class prediction [J].
Chen, J. J. ;
Tsai, C. -A. ;
Moon, H. ;
Ahn, H. ;
Young, J. J. ;
Chen, C. -H. .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2006, 17 (03) :337-352
[9]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[10]  
Dixit A. K., 1990, Optimization in economic theory, V2nd