Model-population analysis and its applications in chemical and biological modeling

被引:96
作者
Li, Hong-Dong [1 ]
Liang, Yi-Zeng [1 ]
Xu, Qing-Song [2 ]
Cao, Dong-Sheng [1 ]
机构
[1] Cent South Univ, Coll Chem & Chem Engn, Changsha 410083, Peoples R China
[2] Cent South Univ, Sch Math Sci, Changsha 410083, Peoples R China
关键词
Algorithm; Bioinformatics; Chemometrics; Complex analytical system; Data modeling; Modeling; Model-population analysis (MPA); Monte Carlo sampling; Outlier detection; Variable selection; MULTIPLE OUTLIER DETECTION; VARIABLE SELECTION; GENE SELECTION; MULTIVARIATE CALIBRATION; CROSS-VALIDATION; CLASSIFICATION; ELIMINATION; REGRESSION; PREDICTION;
D O I
10.1016/j.trac.2011.11.007
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Model-population analysis (MPA) was recently proposed as a general framework for designing new types of chemometrics and bioinformatics algorithms, and it has found promising applications in chemistry and biology. The goal of MPA is to extract useful information from complex analytical systems, so as to lead to better understanding and better modeling of chemical and biological data. To give an overall picture of MPA, we first review its key elements. Then, we describe the theories and the applications of selected methods that focus on the two fundamental aspects in chemical and biological modeling: outlier detection and variable selection. We highlight the key common principles of these methods and pinpoint the critical differences underlying each method. (c) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:154 / 162
页数:9
相关论文
共 44 条
  • [1] Toward a gold standard for promoter prediction evaluation
    Abeel, Thomas
    Van de Peer, Yves
    Saeys, Yvan
    [J]. BIOINFORMATICS, 2009, 25 (12) : I313 - I320
  • [2] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [3] ASSESSING THE QUALITY OF SOLUTION NUCLEAR-MAGNETIC-RESONANCE STRUCTURES BY COMPLETE CROSS-VALIDATION
    BRUNGER, AT
    CLORE, GM
    GRONENBORN, AM
    SAFFRICH, R
    NILGES, M
    [J]. SCIENCE, 1993, 261 (5119) : 328 - 331
  • [4] A variable selection method based on uninformative variable elimination for multivariate calibration of near-infrared spectra
    Cai, Wensheng
    Li, Yankun
    Shao, Xueguang
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2008, 90 (02) : 188 - 194
  • [5] Candes E, 2007, ANN STAT, V35, P2313, DOI 10.1214/009053606000001523
  • [6] A New Strategy of Outlier Detection for QSAR/QSPR
    Cao, Dong-Sheng
    Liang, Yi-Zeng
    Xu, Qing-Song
    Li, Hong-Dong
    Chen, Xian
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2010, 31 (03) : 592 - 602
  • [7] Gene selection in cancer classification using sparse logistic regression with Bayesian regularization
    Cawley, Gavin C.
    Talbot, Nicola L. C.
    [J]. BIOINFORMATICS, 2006, 22 (19) : 2348 - 2355
  • [8] Elimination of uninformative variables for multivariate calibration
    Centner, V
    Massart, DL
    deNoord, OE
    deJong, S
    Vandeginste, BM
    Sterna, C
    [J]. ANALYTICAL CHEMISTRY, 1996, 68 (21) : 3851 - 3858
  • [9] Bayesian linear regression and variable selection for spectroscopic calibration
    Chen, Tao
    Martin, Elaine
    [J]. ANALYTICA CHIMICA ACTA, 2009, 631 (01) : 13 - 21
  • [10] RETRACTED: Double cross-validation and improved sensitivity of the rapid screen of mild traumatic brain injury (Retracted Article. See vol 29, pg 904, 2007)
    De Monte, VE
    Geffen, GM
    May, CR
    McFarland, K
    [J]. JOURNAL OF CLINICAL AND EXPERIMENTAL NEUROPSYCHOLOGY, 2004, 26 (05) : 628 - 644