Incorporation of Biological Pathway Knowledge in the Construction of Priors for Optimal Bayesian Classification

被引:32
|
作者
Esfahani, Mohammad Shahrokh [1 ,2 ]
Dougherty, Edward R. [1 ,2 ]
机构
[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
[2] Texas A&M Univ, Ctr Bioinformat & Genom Syst Engn, College Stn, TX USA
关键词
Phenotype classification; biological pathway knowledge; optimal Bayesian classifier (OBC); prior probability construction; regularization; convex optimization; synthetic pathway generation; SQUARE ERROR ESTIMATION; MINIMUM EXPECTED ERROR; OPTIMAL CLASSIFIERS; INFORMATION; CANCER; UNCERTAINTY; FRAMEWORK; MODEL; DISTRIBUTIONS; DISCRETE;
D O I
10.1109/TCBB.2013.143
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Small samples are commonplace in genomic/proteomic classification, the result being inadequate classifier design and poor error estimation. The problem has recently been addressed by utilizing prior knowledge in the form of a prior distribution on an uncertainty class of feature-label distributions. A critical issue remains: how to incorporate biological knowledge into the prior distribution. For genomics/proteomics, the most common kind of knowledge is in the form of signaling pathways. Thus, it behooves us to find methods of transforming pathway knowledge into knowledge of the feature-label distribution governing the classification problem. In this paper, we address the problem of prior probability construction by proposing a series of optimization paradigms that utilize the incomplete prior information contained in pathways (both topological and regulatory). The optimization paradigms employ the marginal log-likelihood, established using a small number of feature-label realizations (sample points) regularized with the prior pathway information about the variables. In the special case of a Normal-Wishart prior distribution on the mean and inverse covariance matrix (precision matrix) of a Gaussian distribution, these optimization problems become convex. Companion website: gsp.tamu.edu/Publications/supplementary/shahrokh13a.
引用
收藏
页码:202 / 218
页数:17
相关论文
共 41 条
  • [1] Incorporating biological prior knowledge for Bayesian learning via maximal knowledge-driven information priors
    Boluki, Shahin
    Esfahani, Mohammad Shahrokh
    Qian, Xiaoning
    Dougherty, Edward R.
    BMC BIOINFORMATICS, 2017, 18
  • [2] Constructing Pathway-Based Priors within a Gaussian Mixture Model for Bayesian Regression and Classification
    Boluki, Shahin
    Esfahani, Mohammad Shahrokh
    Qian, Xiaoning
    Dougherty, Edward R.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (02) : 524 - 537
  • [3] Sample-Based Prior Probability Construction Using Biological Pathway Knowledge
    Esfahani, Mohammad Shahrokh
    Dougherty, Edward R.
    2013 ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2013, : 1405 - 1409
  • [4] Classifier design given an uncertainty class of feature distributions via regularized maximum likelihood and the incorporation of biological pathway knowledge in steady-state phenotype classification
    Esfahani, Mohammad Shahrokh
    Knight, Jason
    Zollanvari, Amin
    Yoon, Byung-Jun
    Dougherty, Edward R.
    PATTERN RECOGNITION, 2013, 46 (10) : 2783 - 2797
  • [5] Optimal Bayesian Classification With Missing Values
    Dadaneh, Siamak Zamani
    Dougherty, Edward R.
    Qian, Xiaoning
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (16) : 4182 - 4192
  • [6] Predictive construction of priors in Bayesian nonparametrics
    Fortini, Sandra
    Petrone, Sonia
    BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2012, 26 (04) : 423 - 449
  • [7] Detecting Multivariate Gene Interactions in RNA-Seq Data Using Optimal Bayesian Classification
    Knight, Jason M.
    Ivanov, Ivan
    Triff, Karen
    Chapkin, Robert S.
    Dougherty, Edward R.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (02) : 484 - 493
  • [8] An Optimization-Based Framework for the Transformation of Incomplete Biological Knowledge into a Probabilistic Structure and Its Application to the Utilization of Gene/Protein Signaling Pathways in Discrete Phenotype Classification
    Esfahani, Mohammad Shahrokh
    Dougherty, Edward R.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (06) : 1304 - 1321
  • [9] Sequential Sampling for Optimal Bayesian Classification of Sequencing Count Data
    Broumand, Ariana
    Dadaneh, Siamak Zamani
    2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 1357 - 1361
  • [10] Optimal Geometry Analysis for Target Localization With Bayesian Priors
    Nguyen, Ngoc Hung
    IEEE ACCESS, 2021, 9 : 33419 - 33437