A comparison of missing-data procedures for ARIMA time-series analysis

被引:58
|
作者
Velicer, WF
Colby, SM
机构
[1] Univ Rhode Isl, Canc Prevent Res Ctr, Kingston, RI 02881 USA
[2] Brown Univ, Providence, RI 02912 USA
关键词
missing data; ARIMA models; time-series analysis; autocorrelation;
D O I
10.1177/0013164404272502
中图分类号
G44 [教育心理学];
学科分类号
0402 ; 040202 ;
摘要
Missing data are a common practical problem for longitudinal designs. Time-series analysis is a longitudinal method that involves a large number of observations on a single unit. Four different missing-data methods (deletion, mean substitution, mean of adjacent observations, and maximum likelihood estimation) were evaluated. Computer-generated time-series data of length 100 were generated for 50 different conditions representing five levels ofautocorrelation, two levels of slope, and five levels of proportion of missing data. Methods were compared with respect to the accuracy of estimation for four parameters (level, error variance, degree of autocorrelation, and slope). The choice of method had a major impact on the analysis. The maximum likelihood very accurately estimated all four parameters under all conditions tested. The mean of the series was the least accurate approach. Statistical methods such as the maximum likelihood procedure represent a superior approach to missing data.
引用
收藏
页码:596 / 615
页数:20
相关论文
共 50 条
  • [41] Comparison of emergency department and hospital admissions data for air pollution time-series studies
    Winquist, A.
    Klein, M.
    Tolbert, P.
    Flanders, W. D.
    Hess, J.
    Sarnat, S. E.
    ENVIRONMENTAL HEALTH, 2012, 11
  • [42] Time-Aware Missing Healthcare Data Prediction Based on ARIMA Model
    Kong, Lingzhen
    Li, Guangshun
    Rafique, Wajid
    Shen, Shigen
    He, Qiang
    Khosravi, Mohammad R.
    Wang, Ruili
    Qi, Lianyong
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (04) : 1042 - 1050
  • [43] Multiple imputation for multivariate data with missing and below-threshold measurements: Time-series concentrations of pollutants in the Arctic
    Hopke, PK
    Liu, CH
    Rubin, DB
    BIOMETRICS, 2001, 57 (01) : 22 - 33
  • [44] TIME-SERIES ANALYSIS MODELS OF ACTIVATED-SLUDGE PLANTS
    NOVOTNY, V
    JONES, H
    FENG, X
    CAPODAGLIO, A
    WATER SCIENCE AND TECHNOLOGY, 1991, 23 (4-6) : 1107 - 1116
  • [45] Trend analysis of time-series data: A novel method for untargeted metabolite discovery
    Peters, Sonja
    Janssen, Hans-Gerd
    Vivo-Truyols, Gabriel
    ANALYTICA CHIMICA ACTA, 2010, 663 (01) : 98 - 104
  • [46] Integrating Machine Learning and Stochastic Pattern Analysis for the Forecasting of Time-Series Data
    Khan A.B.F.
    Kamalakannan K.
    Ahmed N.S.S.
    SN Computer Science, 4 (5)
  • [47] CONTRIBUTION OF TIME-SERIES ANALYSIS TO DATA-PROCESSING OF ASTRONOMICAL OBSERVATIONS IN CHINA
    ZHENG, D
    LUO, S
    STATISTICA SINICA, 1992, 2 (02) : 605 - 618
  • [48] Deep Learning for Anomaly Detection in Time-Series Data: Review, Analysis, and Guidelines
    Choi, Kukjin
    Yi, Jihun
    Park, Changhwa
    Yoon, Sungroh
    IEEE ACCESS, 2021, 9 : 120043 - 120065
  • [49] The "Caterpillar"-SSA method for analysis of time series with missing values
    Golyandina, N.
    Osipov, E.
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2007, 137 (08) : 2642 - 2653
  • [50] REGRESSION-ANALYSIS OF GROUPED SURVIVAL-DATA WITH INCOMPLETE COVARIATES - NONIGNORABLE MISSING-DATA AND CENSORING MECHANISMS
    BAKER, SG
    BIOMETRICS, 1994, 50 (03) : 821 - 826