A note on Mahalanobis and related distance measures in WinISI and The Unscrambler

被引:21
作者
Garrido-Varo, A. [1 ]
Garcia-Olmo, J. [2 ]
Fearn, T. [3 ]
机构
[1] Univ Cordoba, Nondestruct Spectral Sensors Unit, Fac Agr & Forestry Engn, Cordoba, Spain
[2] Univ Cordoba, NIR MIR Spect Unit, Cent Serv Res Support, Cordoba, Spain
[3] UCL, Dept Stat Sci, Gower St, London WC1E 6BT, England
关键词
Mahalanobis distance; leverage; Hotelling's T-2; principal components; near infrared spectroscopy; outliers;
D O I
10.1177/0967033519848296
中图分类号
O69 [应用化学];
学科分类号
081704 ;
摘要
In identifying spectral outliers in near infrared calibration it is common to use a distance measure that is related to Mahalanobis distance. However, different software packages tend to use different variants, which lead to a translation problem if more than one package is used. Here the relationships between squared Mahalanobis distance D-2, the GH distance of WinISI, and the T-2 and leverage (L) statistics of The Unscrambler are established as D-2 = T-2 approximate to L x n approximate to GH x k, where n and k are the numbers of samples and variables, respectively, in the set of spectral data used to establish the distance measure. The implications for setting thresholds for outlier detection are discussed. On the way to this result the principal component scores from WinISI and The Unscrambler are compared. Both packages scale the scores for a component to have variances proportional to the contribution of that component to total variance, but the WinISI scores, unlike those from The Unscrambler, do not have mean zero.
引用
收藏
页码:253 / 258
页数:6
相关论文
共 6 条
  • [1] [Anonymous], 2005, Applied Linear Regression
  • [2] [Anonymous], 2000, Principles of multivariate analysis
  • [3] Ghosh JK, 1998, ENCY BIOSTATISTICS, P2372
  • [4] MAHALANOBIS PRASANTA CHANDRA, 1930, JOUR AND PROC ASIATIC SOC BENGAL, V26, P541
  • [5] Martens H., 2001, MULTIVARIATE CALIBRA
  • [6] Naes T., 2002, NIR PUBLICATIONS