Learning Hierarchical Multi-label Classification Trees from Network Data

被引:0
|
作者
Stojanova, Daniela [1 ]
Ceci, Michelangelo [2 ]
Malerba, Donato [2 ]
Dzeroski, Saso [1 ,3 ,4 ]
机构
[1] Jozef Stefan Inst, Dept Knowledge Technol, Ljubljana, Slovenia
[2] Univ Bari, Dipartimento Informat, Bari, Italy
[3] Jozef Stefan Int Postgrad Sch, Ljubljana, Slovenia
[4] COE, Integrated Approaches Chem & Biol Proteins, Proteins, Slovakia
来源
DISCOVERY SCIENCE | 2013年 / 8140卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an algorithm for hierarchical multi-label classification (HMC) in a network context. It is able to classify instances that may belong to multiple classes at the same time and consider the hierarchical organization of the classes. It assumes that the instances are placed in a network and uses information on the network connections during the learning of the predictive model. Many real world prediction problems have classes that are organized hierarchically and instances that can have pairwise connections. One example is web document classification, where topics (classes) are typically organized into a hierarchy and documents are connected by hyperlinks. Another example, which is considered in this paper, is gene/protein function prediction, where genes/proteins are connected and form protein-to-protein interaction (PPI) networks. Network datasets are characterized by a form of autocorrelation, where the value of a variable at a given node depends on the values of variables at the nodes it is connected with. Combining the hierarchical multi-label classification task with network prediction is thus not trivial and requires the introduction of the new concept of network autocorrelation for HMC. The proposed algorithm is able to profitably exploit network autocorrelation when learning a tree-based prediction model for HMC. The learned model is in the form of a Predictive Clustering Tree (PCT) and predicts multiple (hierarchically organized) labels at the leaves. Experiments show the effectiveness of the proposed approach for different problems of gene function prediction, considering different PPI networks. The results show that different networks introduce different benefits in different problems of gene function prediction.
引用
收藏
页码:233 / 248
页数:16
相关论文
共 50 条
  • [21] The importance of the label hierarchy in hierarchical multi-label classification
    Levatic, Jurica
    Kocev, Dragi
    Dzeroski, Saso
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2015, 45 (02) : 247 - 271
  • [22] The importance of the label hierarchy in hierarchical multi-label classification
    Jurica Levatić
    Dragi Kocev
    Sašo Džeroski
    Journal of Intelligent Information Systems, 2015, 45 : 247 - 271
  • [23] Label Correction Strategy on Hierarchical Multi-Label Classification
    Ananpiriyakul, Thanawut
    Poomsirivilai, Piyapan
    Vateekul, Peerapon
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2014, 2014, 8556 : 213 - 227
  • [24] Dependency Network Methods for Hierarchical Multi-label Classification of Gene Functions
    Fabris, Fabio
    Freitas, Alex A.
    2014 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING (CIDM), 2014, : 241 - 248
  • [25] A deep neural network based hierarchical multi-label classification method
    Feng, Shou
    Zhao, Chunhui
    Fu, Ping
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2020, 91 (02):
  • [26] Deep neural network for hierarchical extreme multi-label text classification
    Gargiulo, Francesco
    Silvestri, Stefano
    Ciampi, Mario
    De Pietro, Giuseppe
    APPLIED SOFT COMPUTING, 2019, 79 : 125 - 138
  • [27] Cognitive structure learning model for hierarchical multi-label text classification
    Wang, Boyan
    Hu, Xuegang
    Li, Peipei
    Yu, Philip S.
    KNOWLEDGE-BASED SYSTEMS, 2021, 218
  • [28] Applying semi-supervised learning in hierarchical multi-label classification
    Santos, Araken
    Canuto, Anne
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (14) : 6075 - 6085
  • [29] Hierarchical Multi-label Classification using Fully Associative Ensemble Learning
    Zhang, L.
    Shah, S. K.
    Kakadiaris, I. A.
    PATTERN RECOGNITION, 2017, 70 : 89 - 103
  • [30] Cost-Effective Active Learning for Hierarchical Multi-Label Classification
    Yan, Yi-Fan
    Huang, Sheng-Jun
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2962 - 2968