Incorporation of Biological Pathway Knowledge in the Construction of Priors for Optimal Bayesian Classification

被引:32
|
作者
Esfahani, Mohammad Shahrokh [1 ,2 ]
Dougherty, Edward R. [1 ,2 ]
机构
[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
[2] Texas A&M Univ, Ctr Bioinformat & Genom Syst Engn, College Stn, TX USA
关键词
Phenotype classification; biological pathway knowledge; optimal Bayesian classifier (OBC); prior probability construction; regularization; convex optimization; synthetic pathway generation; SQUARE ERROR ESTIMATION; MINIMUM EXPECTED ERROR; OPTIMAL CLASSIFIERS; INFORMATION; CANCER; UNCERTAINTY; FRAMEWORK; MODEL; DISTRIBUTIONS; DISCRETE;
D O I
10.1109/TCBB.2013.143
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Small samples are commonplace in genomic/proteomic classification, the result being inadequate classifier design and poor error estimation. The problem has recently been addressed by utilizing prior knowledge in the form of a prior distribution on an uncertainty class of feature-label distributions. A critical issue remains: how to incorporate biological knowledge into the prior distribution. For genomics/proteomics, the most common kind of knowledge is in the form of signaling pathways. Thus, it behooves us to find methods of transforming pathway knowledge into knowledge of the feature-label distribution governing the classification problem. In this paper, we address the problem of prior probability construction by proposing a series of optimization paradigms that utilize the incomplete prior information contained in pathways (both topological and regulatory). The optimization paradigms employ the marginal log-likelihood, established using a small number of feature-label realizations (sample points) regularized with the prior pathway information about the variables. In the special case of a Normal-Wishart prior distribution on the mean and inverse covariance matrix (precision matrix) of a Gaussian distribution, these optimization problems become convex. Companion website: gsp.tamu.edu/Publications/supplementary/shahrokh13a.
引用
收藏
页码:202 / 218
页数:17
相关论文
共 41 条
  • [21] Incorporating biological knowledge for construction of fuzzy networks of gene associations
    Gomez-Vela, Francisco
    Barranco, Carlos D.
    Diaz-Diaz, Norberto
    APPLIED SOFT COMPUTING, 2016, 42 : 144 - 155
  • [22] Incorporation of heuristic knowledge in the optimal design of formulated products: Application to a cosmetic emulsion
    Arrieta-Escobar, Javier A.
    Bernardo, Fernando P.
    Orjuela, Alvaro
    Camargo, Mauricio
    Morel, Laure
    COMPUTERS & CHEMICAL ENGINEERING, 2019, 122 : 265 - 274
  • [23] Integration of Pathway Knowledge and Dynamic Bayesian Networks for the Prediction of Oral Cancer Recurrence
    Kourou, Konstantina
    Papaloukas, Costas
    Fotiadis, Dimitrios I.
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2017, 21 (02) : 320 - 327
  • [24] Sparse Bayesian classification and feature selection for biological expression data with high correlations
    Yang, Xian
    Pan, Wei
    Guo, Yike
    PLOS ONE, 2017, 12 (12):
  • [25] Inclusion of biological knowledge in a Bayesian shrinkage model for joint estimation of SNP effects
    Pereira, Miguel
    Thompson, John R.
    Weichenberger, Christian X.
    Thomas, Duncan C.
    Minelli, Cosetta
    GENETIC EPIDEMIOLOGY, 2017, 41 (04) : 320 - 331
  • [26] Distilling heterogeneous knowledge with aligned biological entities for histological image classification
    Wang, Kang
    Zheng, Feiyang
    Guan, Dayan
    Liu, Jia
    Qin, Jing
    PATTERN RECOGNITION, 2025, 160
  • [27] Quantitative utilization of prior biological knowledge in the Bayesian network modeling of gene expression data
    Gao, Shouguo
    Wang, Xujing
    BMC BIOINFORMATICS, 2011, 12
  • [28] Pathway analysis of high-throughput biological data within a Bayesian network framework
    Isci, Senol
    Ozturk, Cengizhan
    Jones, Jon
    Otu, Hasan H.
    BIOINFORMATICS, 2011, 27 (12) : 1667 - 1674
  • [29] Are we really Bayesian? Probabilistic inference shows sub-optimal knowledge transfer
    Lin, Chin-Hsuan Sophie
    Do, Trang Thuy
    Unsworth, Lee
    Garrido, Marta I.
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (01)
  • [30] MCMC implementation of the optimal Bayesian classifier for non-Gaussian models: model-based RNA-Seq classification
    Knight, Jason M.
    Ivanov, Ivan
    Dougherty, Edward R.
    BMC BIOINFORMATICS, 2014, 15