Incorporation of Biological Pathway Knowledge in the Construction of Priors for Optimal Bayesian Classification

被引:32
|
作者
Esfahani, Mohammad Shahrokh [1 ,2 ]
Dougherty, Edward R. [1 ,2 ]
机构
[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
[2] Texas A&M Univ, Ctr Bioinformat & Genom Syst Engn, College Stn, TX USA
关键词
Phenotype classification; biological pathway knowledge; optimal Bayesian classifier (OBC); prior probability construction; regularization; convex optimization; synthetic pathway generation; SQUARE ERROR ESTIMATION; MINIMUM EXPECTED ERROR; OPTIMAL CLASSIFIERS; INFORMATION; CANCER; UNCERTAINTY; FRAMEWORK; MODEL; DISTRIBUTIONS; DISCRETE;
D O I
10.1109/TCBB.2013.143
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Small samples are commonplace in genomic/proteomic classification, the result being inadequate classifier design and poor error estimation. The problem has recently been addressed by utilizing prior knowledge in the form of a prior distribution on an uncertainty class of feature-label distributions. A critical issue remains: how to incorporate biological knowledge into the prior distribution. For genomics/proteomics, the most common kind of knowledge is in the form of signaling pathways. Thus, it behooves us to find methods of transforming pathway knowledge into knowledge of the feature-label distribution governing the classification problem. In this paper, we address the problem of prior probability construction by proposing a series of optimization paradigms that utilize the incomplete prior information contained in pathways (both topological and regulatory). The optimization paradigms employ the marginal log-likelihood, established using a small number of feature-label realizations (sample points) regularized with the prior pathway information about the variables. In the special case of a Normal-Wishart prior distribution on the mean and inverse covariance matrix (precision matrix) of a Gaussian distribution, these optimization problems become convex. Companion website: gsp.tamu.edu/Publications/supplementary/shahrokh13a.
引用
收藏
页码:202 / 218
页数:17
相关论文
共 41 条
  • [31] Integrating Bayesian networks and ontology to improve safety knowledge management in construction behavior: A conceptual framework
    Wang, Junwu
    Liu, Yipeng
    Feng, Jingtao
    AIN SHAMS ENGINEERING JOURNAL, 2024, 15 (09)
  • [32] LOGIC FORMULAS BASED KNOWLEDGE DISCOVERY AND ITS APPLICATION TO THE CLASSIFICATION OF BIOLOGICAL DATA
    Felici, G.
    Bertolazzi, P.
    Guarracino, M. R.
    Chinchuluun, A.
    Pardalos, P. M.
    BIOMAT 2008, 2009, : 265 - +
  • [33] Scalable Optimal Bayesian Classification of Single-Cell Trajectories under Regulatory Model Uncertainty
    Hajiramezanali, Ehsan
    Imani, Mahdi
    Braga-Neto, Ulisses
    Qian, Xiaoning
    Dougherty, Edward R.
    ACM-BCB'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2018, : 596 - 597
  • [34] Scalable optimal Bayesian classification of single-cell trajectories under regulatory model uncertainty
    Hajiramezanali, Ehsan
    Imani, Mahdi
    Braga-Neto, Ulisses
    Qian, Xiaoning
    Dougherty, Edward R.
    BMC GENOMICS, 2019, 20 (Suppl 6)
  • [35] Optimal "Anti-Bayesian" Parametric Pattern Classification for the Exponential Family Using Order Statistics Criteria
    Thomas, A.
    Oommen, B. John
    IMAGE ANALYSIS AND RECOGNITION, PT I, 2012, 7324 : 11 - 18
  • [36] Ontology-Based Semantic Modeling of Knowledge in Construction: Classification and Identification of Hazards Implied in Images
    Zhong, Botao
    Li, Heng
    Luo, Hanbin
    Zhou, Jingyang
    Fang, Weili
    Xing, Xuejiao
    JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2020, 146 (04)
  • [37] Biological Knowledge Integration in DNA Microarray Gene Expression Classification Based on Rough Set Theory
    Calvo-Dmgz, D.
    Galvez, J. F.
    Glez-Pena, Daniel
    Fdez-Riverola, Florentino
    6TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, 2012, 154 : 53 - 61
  • [38] Inference of Large-Scale Gene Regulatory Networks Using GA-based Bayesian Network and Biological Knowledge
    Tavakolkhah, Pegah
    Rahmati, Mohammad
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 598 - 601
  • [39] Integration of biological, economic, and sociological knowledge by Bayesian belief networks: the interdisciplinary evaluation of potential management plans for Baltic salmon
    Levontin, Polina
    Kulmala, Soile
    Haapasaari, Paivi
    Kuikka, Sakari
    ICES JOURNAL OF MARINE SCIENCE, 2011, 68 (03) : 632 - 638
  • [40] Ontology-based semantic modelling to support knowledge-based document classification on disaster-resilient construction practices
    Dhakal, Sunil
    Zhang, Lu
    Lv, Xuan
    INTERNATIONAL JOURNAL OF CONSTRUCTION MANAGEMENT, 2022, 22 (11) : 2059 - 2078