A top-down supervised learning approach to hierarchical multi-label classification in networks

被引:0
作者
Miguel Romero
Jorge Finke
Camilo Rocha
机构
[1] Pontificia Universidad Javeriana,Department of Electronics and Computer Science
来源
Applied Network Science | / 7卷
关键词
Hierarchical classification; Supervised learning; XGBoost; Top-down approach; Gene function prediction;
D O I
暂无
中图分类号
学科分类号
摘要
Node classification is the task of inferring or predicting missing node attributes from information available for other nodes in a network. This paper presents a general prediction model to hierarchical multi-label classification, where the attributes to be inferred can be specified as a strict poset. It is based on a top-down classification approach that addresses hierarchical multi-label classification with supervised learning by building a local classifier per class. The proposed model is showcased with a case study on the prediction of gene functions for Oryza sativa Japonica, a variety of rice. It is compared to the Hierarchical Binomial-Neighborhood, a probabilistic model, by evaluating both approaches in terms of prediction performance and computational cost. The results in this work support the working hypothesis that the proposed model can achieve good levels of prediction efficiency, while scaling up in relation to the state of the art.
引用
收藏
相关论文
共 107 条
  • [11] Harris MA(2018)ATTED-II in 2018: a plant coexpression database based on investigation of the statistical property of the mutual rank index Plant Cell Physiol 59 3-215
  • [12] Hill DP(2008)Conserved co-expression for candidate disease gene prioritization BMC Bioinform 9 208-6
  • [13] Issel-Tarver L(2012)A survey and current research challenges in multi-label classification methods Int J Soft Comput Eng (IJSCE) 2 248-72
  • [14] Kasarskis A(2016)Hierarchical multilabel classification based on path evaluation Int J Approx Reason 68 179-546
  • [15] Lewis S(2019)Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead Nat Mach Intell 1 206-12788
  • [16] Matese JC(2013)Rice annotation project database (RAP-DB): an integrative and interactive database for rice genomics Plant Cell Physiol 54 6-undefined
  • [17] Richardson JE(2016)Learning from co-expression networks: possibilities and challenges Front Plant Sci 22 31-undefined
  • [18] Ringwald M(2011)A survey of hierarchical classification across different application domains Data Min Knowl Disc 150 535-undefined
  • [19] Rubin GM(2017)Gene co-expression analysis for functional classification and gene-disease predictions Brief Bioinform 33 4-undefined
  • [20] Sherlock G(2009)Unraveling transcriptional control in arabidopsis using cis-regulatory elements and coexpression networks Plant Physiol 99 12783-undefined