Re-evaluating the role of the Mahalanobis distance measure

被引:33
作者
Brereton, Richard G. [1 ]
Lloyd, Gavin R. [2 ]
机构
[1] Univ Bristol, Sch Chem, Cantocks Close, Bristol BS8 1TS, Avon, England
[2] Gloucestershire Hosp NHS Fdn Trust, Biophoton Res Unit, Great Western Rd, Gloucester GL1 3NN, England
关键词
Mahalanobis distance; pooled variance-covariance matrix; linear discriminant analysis; soft models; one-class classifiers; discrimination; principal component analysis; F DISTRIBUTION; MULTIVARIATE;
D O I
10.1002/cem.2779
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is shown that the sum of squares of the standardised scores of all non-zero principal components (PCs) equals the squared Mahalanobis distance. A new distance measure, the reduced Mahalanobis distance, is explored in which the number of PCs retained is less than the full rank model. It is illustrated by both one-class and two-class classifiers. Linear discriminant analysis can be employed as a soft model, and principal component analysis using the pooled variance-covariance matrix is introduced as an intermediate view between conjoint and disjoint models allowing linear discriminant analysis to be used on these reduced rank models. By choosing the most discriminatory PCs, it can be shown that the reduced Mahalanobis distance has superior performance over the full rank model for discriminating via soft models. Copyright (c) 2016 John Wiley & Sons, Ltd. The link between principal component analysis (PCA) and the Mahalanobis distance is discussed and a new reduced Mahalanobis distance measure is explored using both one- and two- class classifiers.
引用
收藏
页码:134 / 143
页数:10
相关论文
共 27 条
[1]  
[Anonymous], 1908, BIOMETRIKA, V6, P1
[2]  
[Anonymous], 2009, CHEMOMETRICS PATTERN
[3]   Hotelling's T squared distribution, its relationship to the F distribution and its use in multivariate space [J].
Brereton, Richard G. .
JOURNAL OF CHEMOMETRICS, 2016, 30 (01) :18-21
[4]   The F distribution and its relationship to the chi squared and t distributions [J].
Brereton, Richard G. .
JOURNAL OF CHEMOMETRICS, 2015, 29 (11) :582-586
[5]   The t-distribution and its relationship to the normal distribution [J].
Brereton, Richard G. .
JOURNAL OF CHEMOMETRICS, 2015, 29 (09) :481-483
[6]   The Mahalanobis distance and its relationship to principal component scores [J].
Brereton, Richard G. .
JOURNAL OF CHEMOMETRICS, 2015, 29 (03) :143-145
[7]   One-class classifiers [J].
Brereton, Richard G. .
JOURNAL OF CHEMOMETRICS, 2011, 25 (05) :225-246
[8]   USE OF A MICROCOMPUTER FOR THE DEFINITION OF MULTIVARIATE CONFIDENCE-REGIONS IN MEDICAL DIAGNOSIS BASED ON CLINICAL LABORATORY PROFILES [J].
COOMANS, D ;
BROECKAERT, I ;
DERDE, MP ;
TASSIN, A ;
MASSART, DL ;
WOLD, S .
COMPUTERS AND BIOMEDICAL RESEARCH, 1984, 17 (01) :1-14
[9]   The Mahalanobis distance [J].
De Maesschalck, R ;
Jouan-Rimbaud, D ;
Massart, DL .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2000, 50 (01) :1-18
[10]   Comparison of performance of five common classifiers represented as boundary methods: Euclidean Distance to Centroids, Linear Discriminant Analysis, Quadratic Discriminant Analysis, Learning Vector Quantization and Support Vector Machines, as dependent on data structure [J].
Dixon, Sarah J. ;
Brereton, Richard G. .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2009, 95 (01) :1-17