RETRACTED: Three-way selection random forest algorithm based on decision boundary entropy (Retracted Article)

被引:15
作者
Zhang, Chunying [1 ,2 ]
Ren, Jing [1 ]
Liu, Fengchun [3 ]
Li, Xiaoqi [1 ]
Liu, Shouyue [1 ]
机构
[1] North China Univ Sci & Technol, Coll Sci, Tangshan 063210, Hebei, Peoples R China
[2] Key Lab Data Sci & Applicat Hebei Prov, Tangshan 063210, Hebei, Peoples R China
[3] North China Univ Sci & Technol, Coll Qianan, Tangshan 063210, Hebei, Peoples R China
关键词
Random Forest; Attribute Selection; Decision Boundary Entropy; Significance of Attribute; Three-way Decision; ATTRIBUTES;
D O I
10.1007/s10489-021-03033-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at the problem of high probability of negative impact about redundant attributes in random forest algorithms, a Three-way Selection Random Forest algorithm based on decision boundary entropy (TSRF) is proposed without losing randomness and reducing the influence of redundant attributes on decision-making results. According to the characteristics of the attribute, the concept of decision boundary entropy is defined. Then a measuring method of attribute importance based on decision boundary entropy is proposed and set as an evaluation standard. Three-way decision is constructed and the attribute is divided into three candidate domains, namely positive domain, negative domain and boundary domain. In order to ensure the randomness of attributes, three-way attribute random selection rules based on attribute randomness are established and a certain number of attributes are randomly selected from the three candidate domains. Combine the samples selected by the bootstrap sampling method with attribute sets selected by three-way decision to produce training sample sets so that we can train the decision trees and generate forest. Six datasets are selected for the experiment. Two parameters of attribute randomness and three-way decision thresholds are analyzed to verify the theoretical conclusions respectively. The results show that the TSRF algorithm can meet the different requirements of different data sets by adjusting the parameters. The classification effect on the binary data is basically the same as the comparison algorithm, but TSRF has a significant improvement effect on the multi-class data compared with other algorithms. The proposed TSRF algorithm widens the idea for the measurement method of significance of attribute, innovates the random forest three-way selection integration method, and provides a better model framework for solving multi-classification problems.
引用
收藏
页码:13384 / 13397
页数:14
相关论文
共 50 条
[41]   A general conflict analysis model based on three-way decision [J].
Guangming Lang .
International Journal of Machine Learning and Cybernetics, 2020, 11 :1083-1094
[42]   Effectiveness measure in change-based three-way decision [J].
Jiang, Chunmao ;
Duan, Ying ;
Guo, Doudou .
SOFT COMPUTING, 2023, 27 (06) :2783-2793
[43]   An active learning method based on three-way decision model [J].
Hu F. ;
Zhang M. ;
Yu H. .
Kongzhi yu Juece/Control and Decision, 2019, 34 (04) :718-726
[44]   Normal Fuzzy Three-Way Decision Based on Prospect Theory [J].
Li, Yanhua ;
Liu, Jiafen ;
Zhong, Yihua ;
Yang, Xin .
ROUGH SETS, IJCRS 2023, 2023, 14481 :463-478
[45]   Three-way decision with incomplete information based on similarity and satisfiability [J].
Luo, Junfang ;
Hu, Mengjun ;
Qin, Keyun .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2020, 120 :151-183
[46]   Effectiveness measure in change-based three-way decision [J].
Chunmao Jiang ;
Ying Duan ;
Doudou Guo .
Soft Computing, 2023, 27 :2783-2793
[47]   A prospect theory-based three-way decision model [J].
Wang, Tianxing ;
Li, Huaxiong ;
Zhou, Xianzhong ;
Huang, Bing ;
Zhu, Haibin .
KNOWLEDGE-BASED SYSTEMS, 2020, 203
[48]   A general conflict analysis model based on three-way decision [J].
Lang, Guangming .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (05) :1083-1094
[49]   Generalized three-way decision models based on subset evaluation [J].
Li, Xiaonan ;
Yi, Huangjian ;
She, Yanhong ;
Sun, Bingzhen .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2017, 83 :142-159
[50]   Random Decision DAG: An Entropy Based Compression Approach for Random Forest [J].
Liu, Xin ;
Liu, Xiao ;
Lai, Yongxuan ;
Yang, Fan ;
Zeng, Yifeng .
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 :319-323