Incorporation of Biological Pathway Knowledge in the Construction of Priors for Optimal Bayesian Classification

被引:32
|
作者
Esfahani, Mohammad Shahrokh [1 ,2 ]
Dougherty, Edward R. [1 ,2 ]
机构
[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
[2] Texas A&M Univ, Ctr Bioinformat & Genom Syst Engn, College Stn, TX USA
关键词
Phenotype classification; biological pathway knowledge; optimal Bayesian classifier (OBC); prior probability construction; regularization; convex optimization; synthetic pathway generation; SQUARE ERROR ESTIMATION; MINIMUM EXPECTED ERROR; OPTIMAL CLASSIFIERS; INFORMATION; CANCER; UNCERTAINTY; FRAMEWORK; MODEL; DISTRIBUTIONS; DISCRETE;
D O I
10.1109/TCBB.2013.143
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Small samples are commonplace in genomic/proteomic classification, the result being inadequate classifier design and poor error estimation. The problem has recently been addressed by utilizing prior knowledge in the form of a prior distribution on an uncertainty class of feature-label distributions. A critical issue remains: how to incorporate biological knowledge into the prior distribution. For genomics/proteomics, the most common kind of knowledge is in the form of signaling pathways. Thus, it behooves us to find methods of transforming pathway knowledge into knowledge of the feature-label distribution governing the classification problem. In this paper, we address the problem of prior probability construction by proposing a series of optimization paradigms that utilize the incomplete prior information contained in pathways (both topological and regulatory). The optimization paradigms employ the marginal log-likelihood, established using a small number of feature-label realizations (sample points) regularized with the prior pathway information about the variables. In the special case of a Normal-Wishart prior distribution on the mean and inverse covariance matrix (precision matrix) of a Gaussian distribution, these optimization problems become convex. Companion website: gsp.tamu.edu/Publications/supplementary/shahrokh13a.
引用
收藏
页码:202 / 218
页数:17
相关论文
共 41 条