Mutual information-based multi-output tree learning algorithm

被引:0
|
作者
Kang, Hyun-Seok [1 ,2 ]
Jun, Chi-Hyuck [3 ]
机构
[1] POSCO, Tech Res Labs, Pohang, South Korea
[2] Pohang Univ Sci & Technol POSTECH, Grad Inst Ferrous Technol, Pohang, South Korea
[3] Pohang Univ Sci & Technol POSTECH, Dept Ind & Management Engn, 77 Cheongam Ro, Pohang 37859, Gyeongbuk, South Korea
基金
新加坡国家研究基金会;
关键词
Machine learning; mutual information; variable selection; multi-output tree; large data sets; MULTIVARIATE REGRESSION TREES; CLASSIFICATION; SELECTION; IDENTIFICATION; SUBGROUPS;
D O I
10.3233/IDA-205367
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A tree model with low time complexity can support the application of artificial intelligence to industrial systems. Variable selection based tree learning algorithms are more time efficient than existing Classification and Regression Tree (CART) algorithms. To our best knowledge, there is no attempt to deal with categorical input variable in variable selection based multi-output tree learning. Also, in the case of multi-output regression tree, a conventional variable selection based algorithm is not suitable to large datasets. We propose a mutual information-based multi-output tree learning algorithm that consists of variable selection and split optimization. The proposed method discretizes each variable based on k-means into 2-4 clusters and selects the variable for splitting based on the discretized variables using mutual information. This variable selection component has relatively low time complexity and can be applied regardless of output dimension and types. The proposed split optimization component is more efficient than an exhaustive search. The performance of the proposed tree learning algorithm is similar to or better than that of a multi-output version of CART algorithm on a specific dataset. In addition, with a large dataset, the time complexity of the proposed algorithm is significantly reduced compared to a CART algorithm.
引用
收藏
页码:1525 / 1545
页数:21
相关论文
共 50 条
  • [1] MUTUAL INFORMATION-BASED FAIR ACTIVE LEARNING
    Sonoda, Ryosuke
    Srinivasan, Ramya
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4965 - 4969
  • [2] Dynamic mutual information-based feature selection for multi-label learning
    Kim, Kyung-Jun
    Jun, Chi-Hyuck
    INTELLIGENT DATA ANALYSIS, 2023, 27 (04) : 891 - 909
  • [3] Modified Mutual Information-based Feature Selection for Intrusion Detection Systems in Decision Tree Learning
    Song, Jingping
    Zhu, Zhiliang
    Scully, Peter
    Price, Chris
    JOURNAL OF COMPUTERS, 2014, 9 (07) : 1542 - 1546
  • [4] Mutual information-based label distribution feature selection for multi-label learning
    Qian, Wenbin
    Huang, Jintao
    Wang, Yinglong
    Shu, Wenhao
    KNOWLEDGE-BASED SYSTEMS, 2020, 195
  • [5] MIRA: mutual information-based reporter algorithm for metabolic networks
    Cicek, A. Ercument
    Roeder, Kathryn
    Ozsoyoglu, Gultekin
    BIOINFORMATICS, 2014, 30 (12) : 175 - 184
  • [6] Survey on Multi-Output Learning
    Xu, Donna
    Shi, Yaxin
    Tsang, Ivor W.
    Ong, Yew-Soon
    Gong, Chen
    Shen, Xiaobo
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2409 - 2429
  • [7] Regularization-based model tree for multi-output regression
    Jeong, Jun-Yong
    Kang, Ju-Seok
    Jun, Chi-Hyuck
    INFORMATION SCIENCES, 2020, 507 : 240 - 255
  • [8] ROBUST MUTUAL INFORMATION-BASED MULTI-IMAGE REGISTRATION
    Liu, Dehong
    Mansour, Hassan
    Boufounos, Petros T.
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 915 - 918
  • [9] Traffic Prediction With Transfer Learning: A Mutual Information-Based Approach
    Huang, Yunjie
    Song, Xiaozhuang
    Zhu, Yuanshao
    Zhang, Shiyao
    Yu, James J. Q.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (08) : 8236 - 8252
  • [10] Conditional Mutual Information-Based Generalization Bound for Meta Learning
    Rezazadeh, Arezou
    Jose, Sharu Theresa
    Durisi, Giuseppe
    Simeone, Osvaldo
    2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 1176 - 1181