Correlation Analysis in Contaminated Data by Singular Spectrum Analysis

被引:22
作者
Rodrigues, Paulo Canas [1 ,2 ]
Mahmoudvand, Rahim [1 ,3 ]
机构
[1] Univ Fed Bahia, Dept Stat, Salvador, BA, Brazil
[2] Univ Tampere, CAST, Tampere, Finland
[3] Bu Ali Sina Univ, Dept Stat, Hamadan, Iran
关键词
correlation analysis; singular spectrum analysis; robust correlation; robust hybrid filtering methods;
D O I
10.1002/qre.2027
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Correlation analysis is one of the standard and most informative descriptive statistical tools when studying relationships between variables in bivariate and multivariate data. However, when data is contaminated with outlying observations, the standard Pearson correlation might be misleading and result in erroneous outcomes. In this paper, we propose three new approaches to find linear correlation based on the nonparametric method designed to analyse time series data, the singular spectrum analysis. In these proposals, the correlation is obtained after removing the noise from the data by using singular spectrum analysis based methods. The usefulness of our proposals in contaminated data is assessed by Monte Carlo simulation with different schemes of contamination, and with applications to real data on aluminium industry and synthetic sparse data. In addition, the model comparisons are made with robust hybrid filtering methods. Copyright (C) 2016 JohnWiley & Sons, Ltd.
引用
收藏
页码:2127 / 2137
页数:11
相关论文
共 21 条
[1]  
ABDULLAH MB, 1990, J ROY STAT SOC D-STA, V39, P455
[2]   Application of singular spectrum analysis to the smoothing of raw kinematic signals [J].
Alonso, FJ ;
Del Castillo, JM ;
Pintado, P .
JOURNAL OF BIOMECHANICS, 2005, 38 (05) :1085-1092
[3]  
BARREIRO JM, 2004, BIOL MED DATA ANAL
[4]   A little-known robust estimator of the correlation coefficient and its use in a robust graphical test for bivariate normality with applications in the aluminium industry [J].
Evandt, O ;
Coleman, S ;
Ramalhoto, MF ;
van Lottum, C .
QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2004, 20 (05) :433-456
[5]   Repeated median and hybrid filters [J].
Fried, R ;
Bernholt, T ;
Gather, U .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (09) :2313-2338
[6]   A comparison of automatic filtering techniques applied to biomechanical walking data [J].
Giakas, G ;
Baltzopoulos, V .
JOURNAL OF BIOMECHANICS, 1997, 30 (08) :847-850
[7]  
Golyandina N., 2001, ANAL TIME SERIES STR
[8]  
Golyandina N, 2010, STAT INTERFACE, V3, P259
[9]  
HASSANI H, 2011, COMPETS RENDES MATH, V351, P987
[10]   MULTIVARIATE SINGULAR SPECTRUM ANALYSIS: A GENERAL VIEW AND NEW VECTOR FORECASTING APPROACH [J].
Hassani, Hossein ;
Mahmoudvand, Rahim .
INTERNATIONAL JOURNAL OF ENERGY AND STATISTICS, 2013, 1 (01) :55-83