Optimal classifiers with minimum expected error within a Bayesian framework-Part I: Discrete and Gaussian models

被引:64
作者
Dalton, Lori A. [1 ]
Dougherty, Edward R. [2 ,3 ]
机构
[1] Ohio State Univ, Dept Elect & Comp Engn, Dreese Lab 205, Columbus, OH 43210 USA
[2] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX USA
[3] Translat Genom Res Inst, Computat Biol Div, Phoenix, AZ USA
基金
美国国家科学基金会;
关键词
Bayesian estimation; Classification; Error estimation; Genomics; Minimum mean-square estimation; Small samples; OPTIMAL NUMBER; CLASSIFICATION ERROR; FEATURES; ESTIMATORS;
D O I
10.1016/j.patcog.2012.10.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, biomedicine has faced a flood of difficult small-sample phenotype discrimination problems. A host of classification rules have been proposed to discriminate types of pathology, stages of disease and other diagnoses. Typically, these classification rules are heuristic algorithms, with very little understood about their performance. To give a concrete mathematical structure to the problem, recent work has utilized a Bayesian modeling framework based on an uncertainty class of feature-label distributions to both optimize and analyze error estimator performance. The current study uses the same Bayesian framework to also optimize classifier design. This completes a Bayesian theory of classification, where both the classifier error and the estimate of the error may be optimized and studied probabilistically within the model framework. This paper, the first of a two-part study, derives optimal classifiers in discrete and Gaussian models, demonstrates their superior performance over popular classifiers within the assumed model, and applies the method to real genomic data. The second part of the study discusses properties of these optimal Bayesian classifiers. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1301 / 1314
页数:14
相关论文
共 26 条
[1]  
[Anonymous], 1961, Applied Statistical Decision Theory
[2]  
[Anonymous], EURASIP J BIOINFORMA
[3]  
[Anonymous], 1925, STAT METHODS RES WOR
[4]  
[Anonymous], 2000, Pattern Classification
[5]   Is cross-validation valid for small-sample microarray classification? [J].
Braga-Neto, UM ;
Dougherty, ER .
BIOINFORMATICS, 2004, 20 (03) :374-380
[6]   Exact Sample Conditioned MSE Performance of the Bayesian MMSE Estimator for Classification Error-Part II: Consistency and Performance Analysis [J].
Dalton, Lori A. ;
Dougherty, Edward R. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (05) :2588-2603
[7]   Exact Sample Conditioned MSE Performance of the Bayesian MMSE Estimator for Classification Error-Part I: Representation [J].
Dalton, Lori A. ;
Dougherty, Edward R. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (05) :2575-2587
[8]   Optimal mean-square-error calibration of classifier error estimators under Bayesian models [J].
Dalton, Lori A. ;
Dougherty, Edward R. .
PATTERN RECOGNITION, 2012, 45 (06) :2308-2320
[9]   Application of the Bayesian MMSE estimator for classification error to gene expression microarray data [J].
Dalton, Lori A. ;
Dougherty, Edward R. .
BIOINFORMATICS, 2011, 27 (13) :1822-1831
[10]   Bayesian Minimum Mean-Square Error Estimation for Classification Error-Part I: Definition and the Bayesian MMSE Error Estimator for Discrete Classification [J].
Dalton, Lori A. ;
Dougherty, Edward R. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2011, 59 (01) :115-129