Model population analysis in model evaluation

被引:11
作者
Deng, Baichuan [1 ]
Lu, Hongmei [2 ]
Tan, Chengquan [1 ]
Deng, Jinping [1 ]
Yin, Yulong [1 ]
机构
[1] South China Agr Univ, Guangdong Prov Key Lab Anim Nutr Control, Inst Subtrop Anim Nutr & Feed, Coll Anim Sci, Guangzhou 510642, Guangdong, Peoples R China
[2] Cent S Univ, Coll Chem & Chem Engn, Changsha 410083, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Model population analysis; Model evaluation; Model comparison; Model selection; CARLO CROSS-VALIDATION; MULTIVARIATE CALIBRATION; VARIABLE SELECTION; BIOMARKER DISCOVERY; REGRESSION; CHEMOMETRICS; OPTIMIZES; STRATEGY;
D O I
10.1016/j.chemolab.2017.11.016
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Model evaluation plays a central role in chemical modeling. Model population analysis (MPA), a general framework for designing new types of chemometrics algorithms, has shown its advantage in the field of model evaluation. The core idea of MPA is to statistically analyze the outputs of randomly generated sub-models to extract interesting information from the data. One of the most obvious characteristics of MPA-based methods is that they use multiple models instead of a single model for model evaluation. In this review, we described the concept of MPA, and then discussed the application of MPA in model evaluation, including the relationship between MPA and cross-validation, model comparison, randomization tests, model stability, variable importance and sum of rank differences. Finally, we prospected the potential application of MPA in model evaluation.
引用
收藏
页码:223 / 228
页数:6
相关论文
共 42 条
  • [1] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION
    AKAIKE, H
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) : 716 - 723
  • [2] OCCAM RAZOR
    BLUMER, A
    EHRENFEUCHT, A
    HAUSSLER, D
    WARMUTH, MK
    [J]. INFORMATION PROCESSING LETTERS, 1987, 24 (06) : 377 - 380
  • [3] SCIENCE AND STATISTICS
    BOX, GEP
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1976, 71 (356) : 791 - 799
  • [4] Statistical modeling: The two cultures
    Breiman, L
    [J]. STATISTICAL SCIENCE, 2001, 16 (03) : 199 - 215
  • [5] A New Strategy of Outlier Detection for QSAR/QSPR
    Cao, Dong-Sheng
    Liang, Yi-Zeng
    Xu, Qing-Song
    Li, Hong-Dong
    Chen, Xian
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2010, 31 (03) : 592 - 602
  • [6] A bootstrapping soft shrinkage approach for variable selection in chemical modeling
    Deng, Bai-Chuan
    Yun, Yong-Huan
    Cao, Dong-Sheng
    Yin, Yu-Long
    Wang, Wei-Ting
    Lu, Hong-Mei
    Luo, Qian-Yi
    Liang, Yi-Zeng
    [J]. ANALYTICA CHIMICA ACTA, 2016, 908 : 63 - 74
  • [7] Model population analysis in chemometrics
    Deng, Bai-Chuan
    Yun, Yong-Huan
    Liang, Yi-Zeng
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2015, 149 : 166 - 176
  • [8] A new strategy to prevent over-fitting in partial least squares models based on model population analysis
    Deng, Bai-Chuan
    Yun, Yong-Huan
    Liang, Yi-Zeng
    Cao, Dong-Sheng
    Xu, Qing-Song
    Yi, Lun-Zhao
    Huang, Xin
    [J]. ANALYTICA CHIMICA ACTA, 2015, 880 : 32 - 41
  • [9] A new method for wavelength interval selection that intelligently optimizes the locations, widths and combinations of the intervals
    Deng, Bai-Chuan
    Yun, Yong-Huan
    Ma, Pan
    Lin, Chen-Chen
    Ren, Da-Bing
    Liang, Yi-Zeng
    [J]. ANALYST, 2015, 140 (06) : 1876 - 1885
  • [10] A novel variable selection approach that iteratively optimizes variable space using weighted binary matrix sampling
    Deng, Bai-chuan
    Yun, Yong-huan
    Liang, Yi-zeng
    Yi, Lun-zhao
    [J]. ANALYST, 2014, 139 (19) : 4836 - 4845