Learning Hierarchical Multi-label Classification Trees from Network Data

被引:0
|
作者
Stojanova, Daniela [1 ]
Ceci, Michelangelo [2 ]
Malerba, Donato [2 ]
Dzeroski, Saso [1 ,3 ,4 ]
机构
[1] Jozef Stefan Inst, Dept Knowledge Technol, Ljubljana, Slovenia
[2] Univ Bari, Dipartimento Informat, Bari, Italy
[3] Jozef Stefan Int Postgrad Sch, Ljubljana, Slovenia
[4] COE, Integrated Approaches Chem & Biol Proteins, Proteins, Slovakia
来源
DISCOVERY SCIENCE | 2013年 / 8140卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an algorithm for hierarchical multi-label classification (HMC) in a network context. It is able to classify instances that may belong to multiple classes at the same time and consider the hierarchical organization of the classes. It assumes that the instances are placed in a network and uses information on the network connections during the learning of the predictive model. Many real world prediction problems have classes that are organized hierarchically and instances that can have pairwise connections. One example is web document classification, where topics (classes) are typically organized into a hierarchy and documents are connected by hyperlinks. Another example, which is considered in this paper, is gene/protein function prediction, where genes/proteins are connected and form protein-to-protein interaction (PPI) networks. Network datasets are characterized by a form of autocorrelation, where the value of a variable at a given node depends on the values of variables at the nodes it is connected with. Combining the hierarchical multi-label classification task with network prediction is thus not trivial and requires the introduction of the new concept of network autocorrelation for HMC. The proposed algorithm is able to profitably exploit network autocorrelation when learning a tree-based prediction model for HMC. The learned model is in the form of a Predictive Clustering Tree (PCT) and predicts multiple (hierarchically organized) labels at the leaves. Experiments show the effectiveness of the proposed approach for different problems of gene function prediction, considering different PPI networks. The results show that different networks introduce different benefits in different problems of gene function prediction.
引用
收藏
页码:233 / 248
页数:16
相关论文
共 50 条
  • [41] On active learning in multi-label classification
    Brinker, K
    FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 206 - 213
  • [42] Learning multi-label scene classification
    Boutell, MR
    Luo, JB
    Shen, XP
    Brown, CM
    PATTERN RECOGNITION, 2004, 37 (09) : 1757 - 1771
  • [43] Hierarchical multi-instance multi-label learning for Chinese patent text classification
    Liu, Yunduo
    Xu, Fang
    Zhao, Yushan
    Ma, Zichen
    Wang, Tengke
    Zhang, Shunxiang
    Tian, Yuhao
    CONNECTION SCIENCE, 2024, 36 (01)
  • [44] Hierarchical building use classification from multiple modalities with a multi-label multimodal transformer network
    Zhou, Wen
    Persello, Claudio
    Stein, Alfred
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 132
  • [45] Web Genre Classification via Hierarchical Multi-label Classification
    Madjarov, Gjorgji
    Vidulin, Vedrana
    Dimitrovski, Ivica
    Kocev, Dragi
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2015, 2015, 9375 : 9 - 17
  • [46] MultiPep: a hierarchical deep learning approach for multi-label classification of peptide bioactivities
    Gronning, Alexander G. B.
    Kacprowski, Tim
    Scheele, Camilla
    BIOLOGY METHODS & PROTOCOLS, 2021, 6 (01): : 1 - 16
  • [47] Hierarchical multi-label taxonomic classification of carbonate skeletal grains with deep learning
    Ho, Madison
    Idgunji, Sidhant
    Payne, Jonathan L.
    Koeshidayatullah, Ardiansyah
    SEDIMENTARY GEOLOGY, 2023, 443
  • [48] CEHMR: Curriculum learning enhanced hierarchical multi-label classification for medication recommendation
    Sun, Mengxuan
    Niu, Jinghao
    Yang, Xuebing
    Gu, Yifan
    Zhang, Wensheng
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 143
  • [49] Surj: Ontological Learning for Fast, Accurate, and Robust Hierarchical Multi-label Classification
    Yang, Sean T.
    Howe, Bill
    COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2022, WWW 2022 COMPANION, 2022, : 1106 - 1114
  • [50] Self-Paced Unified Representation Learning for Hierarchical Multi-Label Classification
    Yuan, Zixuan
    Liu, Hao
    Zhou, Haoyi
    Zhang, Denghui
    Zhang, Xiao
    Wang, Hao
    Xiong, Hui
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16623 - 16632