Research on a Decision Tree Classification Algorithm Based on Granular Matrices

被引:2
作者
Meng, Lijuan [1 ]
Bai, Bin [1 ,2 ,3 ,4 ]
Zhang, Wenda [5 ]
Liu, Lu [1 ,2 ,3 ,4 ,6 ]
Zhang, Chunying [1 ,2 ,3 ,4 ,6 ]
机构
[1] North China Univ Sci & Technol, Coll Sci, Tangshan 063210, Peoples R China
[2] North China Univ Sci & Technol, Hebei Engn Res Ctr Intelligentizat Iron Ore Optimi, Tangshan 063210, Peoples R China
[3] North China Univ Sci & Technol, Hebei Key Lab Data Sci & Applicat, Tangshan 063210, Peoples R China
[4] North China Univ Sci & Technol, Key Lab Engn Comp Tangshan City, Tangshan 063210, Peoples R China
[5] North China Univ Sci & Technol, Coll Min Engn, Tangshan 063210, Peoples R China
[6] North China Univ Sci & Technol, Tangshan Intelligent Ind & Image Proc Technol Inno, Tangshan 063210, Peoples R China
关键词
classification; decision tree; granular computing; granular structure; granular matrix; similarity metric matrix; classification accuracy; FEATURE-SELECTION; FUZZY;
D O I
10.3390/electronics12214470
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The decision tree is one of the most important and representative classification algorithms in the field of machine learning, and it is an important technique for solving data mining classification tasks. In this paper, a decision tree classification algorithm based on granular matrices is proposed on the basis of granular computing theory. Firstly, the bit-multiplication and bit-sum operations of granular matrices are defined. The logical operations between granules are replaced by simple multiplication and addition operations, which reduces the operation time. Secondly, the similarity between granules is defined, the similarity metric matrix of the granular space is constructed, the classification actions are extracted from the similarity metric matrix, and the classification accuracy is defined by weighting the classification actions with the probability distribution of the granular space. Finally, the classification accuracy of the conditional attribute is used to select the splitting attributes of the decision tree as the nodes to form forks in the tree, and the similarity between granules is used to judge whether the data types in the sub-datasets are consistent to form the leaf nodes. The feasibility of the algorithm is demonstrated by means of case studies. The results of tests conducted on six UCI public datasets show that the algorithm has higher classification accuracy and better classification performance than the ID3 and C4.5.
引用
收藏
页数:14
相关论文
共 36 条
  • [21] Feature selection based on rough set approach, wrapper approach, and binary whale optimization algorithm
    Tawhid, Mohamed A.
    Ibrahim, Abdelmonem M.
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (03) : 573 - 602
  • [22] Wang R., 2017, Comput. Sci, V44, P129
  • [23] Wang Y., 2021, Comput. Appl, V41, P2785
  • [24] Wei H.N., 2005, Comput. Appl, P39
  • [25] Wu J., 2017, Comput. Sci, V44, P407
  • [26] Wu Si-bo, 2016, Computer Engineering and Science, V38, P2342, DOI 10.3969/j.issn.1007-130X.2016.11.028
  • [27] Xie xin, 2022, Computer Engineering and Applications, P139, DOI 10.3778/j.issn.1002-8331.2011-0040
  • [28] Yang J., 2020, J. Intell. Syst, V15, P166
  • [29] [杨洁 Yang Jie], 2018, [模式识别与人工智能, Pattern Recognition and Artificial Intelligence], V31, P677
  • [30] A characterization of novel rough fuzzy sets of information systems and their application in decision making
    Yu, Bin
    Guo, Lankun
    Li, Qingguo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 122 : 253 - 261