Multiple Outlier Detection: Hypothesis Tests versus Model Selection by Information Criteria

被引:45
作者
Lehmann, Ruediger [1 ]
Loesler, Michael [2 ]
机构
[1] Univ Appl Sci Dresden, Fac Spatial Informat, Friedrich List Pl 1, D-01069 Dresden, Germany
[2] Frankfurt Univ Appl Sci, Fac Architecture Civil Engn & Geomat, Lab Ind Metrol, Nibelungenpl 1, D-60318 Frankfurt, Germany
关键词
Least-squares adjustment; Outlier detection; Hypothesis test; Information criterion; Akaike information criterion (AIC); Data snooping; Model selection; REGRESSION;
D O I
10.1061/(ASCE)SU.1943-5428.0000189
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The detection of multiple outliers can be interpreted as a model selection problem. Models that can be selected are the null model, which indicates an outlier free set of observations, or a class of alternative models, which contain a set of additional bias parameters. A common way to select the right model is by using a statistical hypothesis test. In geodesy data snooping is most popular. Another approach arises from information theory. Here, the Akaike information criterion (AIC) is used to select an appropriate model for a given set of observations. The AIC is based on the Kullback-Leibler divergence, which describes the discrepancy between the model candidates. Both approaches are discussed and applied to test problems: the fitting of a straight line and a geodetic network. Some relationships between data snooping and information criteria are discussed. When compared, it turns out that the information criteria approach is more simple and elegant. Along with AIC there are many alternative information criteria for selecting different outliers, and it is not clear which one is optimal. (C) 2016 American Society of Civil Engineers.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] A New Outlier Detection Model Using Random Walk on Local Information Graph
    Wang, Chao
    Gao, Hui
    Liu, Zhen
    Fu, Yan
    IEEE ACCESS, 2018, 6 : 75531 - 75544
  • [32] Outlier Detection in Wireless Sensor Networks Using Model Selection-Based Support Vector Data Descriptions
    Huan, Zhan
    Wei, Chang
    Li, Guang-Hui
    SENSORS, 2018, 18 (12)
  • [33] Simultaneous Model Selection, Robust Data Reconciliation and Outlier Detection with Swarm Intelligence in a Thermal Reactor Power calculation
    Valdetaro, Eduardo Damianik
    Schirru, Roberto
    ANNALS OF NUCLEAR ENERGY, 2011, 38 (09) : 1820 - 1832
  • [34] PERFORMANCE OF INFORMATION CRITERIA FOR MODEL SELECTION IN A LATENT GROWTH CURVE MIXTURE MODEL
    Usami, Satoshi
    JOURNAL JAPANESE SOCIETY OF COMPUTATIONAL STATISTICS, 2014, 27 (01): : 17 - 48
  • [35] Comparison of Outlier Detection Methods in Linear Regression: A Multiple-Criteria Decision-Making Approach
    Satman, Mehmet Hakan
    ACTA INFOLOGICA, 2023, 7 (02): : 333 - 347
  • [36] Rapid outlier detection, model selection and variable selection using penalized likelihood estimation for general spatial models
    Song, Yunquan
    Fang, Minglu
    Wang, Yuanfeng
    Hou, Yiming
    SPATIAL STATISTICS, 2024, 61
  • [37] Indicators of hotel profitability: Model selection using Akaike information criteria
    Taylor, D. Christopher
    Snipes, Michael
    Barber, Nelson A.
    TOURISM AND HOSPITALITY RESEARCH, 2018, 18 (01) : 61 - 71
  • [38] Information and posterior probability criteria for model selection in local likelihood estimation
    Irizarry, RA
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (453) : 303 - 315
  • [39] CONSISTENCY OF A CLASS OF INFORMATION CRITERIA FOR MODEL SELECTION IN NONLINEAR-REGRESSION
    HAUGHTON, D
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1991, 20 (5-6) : 1619 - 1629
  • [40] Model selection for dynamical systems via sparse regression and information criteria
    Mangan, N. M.
    Kutz, J. N.
    Brunton, S. L.
    Proctor, J. L.
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2017, 473 (2204):