Bayesian Error Analysis for Feature Selection in Biomarker Discovery

被引:2
作者
Pour, Ali Foroughi [1 ]
Dalton, Lori A. [1 ]
机构
[1] Ohio State Univ, Dept Elect & Comp Engn, Columbus, OH 43210 USA
基金
美国国家科学基金会;
关键词
Biomarker discovery; feature selection; error analysis; validation; Bayesian methods; bioinformatics; VARIABLE-SELECTION; MODEL ASSESSMENT; BREAST-CANCER; VALIDATION; EXPRESSION; PROPORTION;
D O I
10.1109/ACCESS.2019.2932622
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a novel Bayesian validation paradigm with several validation metrics tailored to biomarker discovery, including moments (the mean and variance) of the number of false discoveries, the number of missed discoveries, and the false discovery rate. All of these validation metrics can be used with a variety of Bayesian variable selection methods already available in the literature. When used in conjunction with Bayesian models with independent Gaussian features, we call these validation metrics optimal Bayesian feature filtering moments (OBFMs). We find closed-form expressions for OBFMs and show that they are asymptotically Gaussian and consistent even when the modeling assumptions are violated. In both synthetic simulations and real data analysis, OBFMs perform very well in biomarker discovery relative to other methods from the literature.
引用
收藏
页码:127544 / 127563
页数:20
相关论文
共 77 条
[1]   Robust biomarker identification for cancer diagnosis with ensemble feature selection methods [J].
Abeel, Thomas ;
Helleputte, Thibault ;
Van de Peer, Yves ;
Dupont, Pierre ;
Saeys, Yvan .
BIOINFORMATICS, 2010, 26 (03) :392-398
[2]  
Altman DG, 2000, STAT MED, V19, P453, DOI 10.1002/(SICI)1097-0258(20000229)19:4<453::AID-SIM350>3.3.CO
[3]  
2-X
[4]  
[Anonymous], 2016, ARXIV161106649
[5]  
[Anonymous], 2001, MODEL SELECTION, DOI DOI 10.1214/LNMS/1215540968
[6]   Detection of Redundant Fusion Transcripts as Biomarkers or Disease- Specific Therapeutic Targets in Breast Cancer [J].
Asmann, Yan W. ;
Necela, Brian M. ;
Kalari, Krishna R. ;
Hossain, Asif ;
Baker, Tiffany R. ;
Carr, Jennifer M. ;
Davis, Caroline ;
Getz, Julie E. ;
Hostetter, Galen ;
Li, Xing ;
McLaughlin, Sarah A. ;
Radisky, Derek C. ;
Schroth, Gary P. ;
Cunliffe, Heather E. ;
Perez, Edith A. ;
Thompson, E. Aubrey .
CANCER RESEARCH, 2012, 72 (08) :1921-1928
[7]   COX-2 gene expression in colon cancer tissue related to regulating factors and promoter methylation status [J].
Asting, Annika Gustafsson ;
Caren, Helena ;
Andersson, Marianne ;
Lonnroth, Christina ;
Lagerstedt, Kristina ;
Lundholm, Kent .
BMC CANCER, 2011, 11
[8]  
BARBIERI M. M., 2018, MEDIAN PROBABILITY M
[9]   CRITERIA FOR BAYESIAN MODEL CHOICE WITH APPLICATION TO VARIABLE SELECTION [J].
Bayarri, M. J. ;
Berger, J. O. ;
Forte, A. ;
Garcia-Donato, G. .
ANNALS OF STATISTICS, 2012, 40 (03) :1550-1577
[10]  
Benjamini Y, 2001, ANN STAT, V29, P1165