Research on a Decision Tree Classification Algorithm Based on Granular Matrices

被引:2
作者
Meng, Lijuan [1 ]
Bai, Bin [1 ,2 ,3 ,4 ]
Zhang, Wenda [5 ]
Liu, Lu [1 ,2 ,3 ,4 ,6 ]
Zhang, Chunying [1 ,2 ,3 ,4 ,6 ]
机构
[1] North China Univ Sci & Technol, Coll Sci, Tangshan 063210, Peoples R China
[2] North China Univ Sci & Technol, Hebei Engn Res Ctr Intelligentizat Iron Ore Optimi, Tangshan 063210, Peoples R China
[3] North China Univ Sci & Technol, Hebei Key Lab Data Sci & Applicat, Tangshan 063210, Peoples R China
[4] North China Univ Sci & Technol, Key Lab Engn Comp Tangshan City, Tangshan 063210, Peoples R China
[5] North China Univ Sci & Technol, Coll Min Engn, Tangshan 063210, Peoples R China
[6] North China Univ Sci & Technol, Tangshan Intelligent Ind & Image Proc Technol Inno, Tangshan 063210, Peoples R China
关键词
classification; decision tree; granular computing; granular structure; granular matrix; similarity metric matrix; classification accuracy; FEATURE-SELECTION; FUZZY;
D O I
10.3390/electronics12214470
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The decision tree is one of the most important and representative classification algorithms in the field of machine learning, and it is an important technique for solving data mining classification tasks. In this paper, a decision tree classification algorithm based on granular matrices is proposed on the basis of granular computing theory. Firstly, the bit-multiplication and bit-sum operations of granular matrices are defined. The logical operations between granules are replaced by simple multiplication and addition operations, which reduces the operation time. Secondly, the similarity between granules is defined, the similarity metric matrix of the granular space is constructed, the classification actions are extracted from the similarity metric matrix, and the classification accuracy is defined by weighting the classification actions with the probability distribution of the granular space. Finally, the classification accuracy of the conditional attribute is used to select the splitting attributes of the decision tree as the nodes to form forks in the tree, and the similarity between granules is used to judge whether the data types in the sub-datasets are consistent to form the leaf nodes. The feasibility of the algorithm is demonstrated by means of case studies. The results of tests conducted on six UCI public datasets show that the algorithm has higher classification accuracy and better classification performance than the ID3 and C4.5.
引用
收藏
页数:14
相关论文
共 36 条
  • [1] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669
  • [2] Bujnowski P, 2015, ADV INTEL SYS RES, V89, P1253
  • [3] Fuzzy SLIQ decision tree algorithm
    Chandra, B.
    Varghese, P. Paul
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (05): : 1294 - 1301
  • [4] Chen C.E., 2018, J. Northwest Norm. Univ, V54, P11
  • [5] Chen JH, 2009, IEEE RAD FREQ INTEGR, P127, DOI 10.1109/ICCSE.2009.5228509
  • [6] Dynamic Nonparametric Random Forest Using Covariance
    Choi, Seok-Hwan
    Shin, Jin-Myeong
    Choi, Yoon-Ho
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2019, 2019
  • [7] Fu C., 2021, Ph.D. Thesis
  • [8] Honglei G., 2022, J. Natl. Univ. Def. Technol, V44, P67
  • [9] Application of fuzzy decision tree in EOR screening assessment
    Khazali, Nastaran
    Sharifi, Mohammad
    Ahmadi, Mohammad Ali
    [J]. JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2019, 177 : 167 - 180
  • [10] Kozak J, 2019, STUD COMPUT INTELL, V781, P1, DOI 10.1007/978-3-319-93752-6