Fast and simple methods for the optimization of kurtosis used as a projection pursuit index

被引:34
作者
Hou, S. [1 ]
Wentzell, P. D. [1 ]
机构
[1] Dalhousie Univ, Dept Chem, Halifax, NS B3H 4J3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Optimization; Quasi-power method; Univariate kurtosis; Multivariate kurtosis; Projection pursuit; Independent component analysis; FIXED-POINT ALGORITHMS; MULTIVARIATE OUTLIER DETECTION; PATTERN-RECOGNITION;
D O I
10.1016/j.aca.2011.08.006
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
As a powerful method for exploratory data analysis, projection pursuit (PP) often outperforms principal component analysis (PCA) to discover important data structure. PP was proposed in 1970s but has not been widely used in chemistry largely because of the difficulty in the optimization of projection indices. In this work, new algorithms, referred as "quasi-power methods", are proposed to optimize kurtosis as a projection index. The new algorithms are simple, fast, and stable, which makes the search for the global optimum more efficient in the presence of multiple local optima. Maximization of kurtosis is helpful in the detection of outliers, while minimization tends to reveal clusters in the data, so the ability to search separately for the maximum and minimum of kurtosis is desirable. The proposed algorithms can search for either with only minor changes. Unlike other methods, no optimization of step size is required and sphering or whitening of the data is not necessary. Both univariate and multivariate kurtosis can be optimized by the proposed algorithms. The performance of the algorithms is evaluated using three simulated data sets and its utility is demonstrated with three experimental data sets relevant to analytical chemistry. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 48 条
[1]  
Afriat S. N., 1957, MATH P CAMBRIDGE PHI, V53, P800
[2]  
[Anonymous], 1969, Statistical computation, DOI [10.1016/B978-0-12-498150-8.50024-0, DOI 10.1016/B978-0-12-498150-8.50024-0]
[3]  
[Anonymous], LINEAR ALGEBRA MODER
[4]   A robustification of independent component analysis [J].
Brys, G ;
Hubert, M ;
Rousseeuw, PJ .
JOURNAL OF CHEMOMETRICS, 2005, 19 (5-7) :364-375
[5]   Fluorescence spectroscopy and PARAFAC in the analysis of yogurt [J].
Christensen, J ;
Becker, EM ;
Frederiksen, CS .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2005, 75 (02) :201-208
[6]  
Croux C., 1996, COMPSTAT. Proceedings in Computational Statistics. 12th Symposium, P211
[7]   Explaining a presence of groups in analytical data in terms of original variables [J].
Daszykowski, M ;
Stanimirova, I ;
Walczak, B ;
Coomans, D .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2005, 78 (1-2) :19-29
[8]   A journey into low-dimensional spaces with autoassociative neural networks [J].
Daszykowski, M ;
Walczak, B ;
Massart, DL .
TALANTA, 2003, 59 (06) :1095-1105
[9]   ADAPTIVE BLIND SEPARATION OF INDEPENDENT SOURCES - A DEFLATION APPROACH [J].
DELFOSSE, N ;
LOUBATON, P .
SIGNAL PROCESSING, 1995, 45 (01) :59-83
[10]   SUPERVISED PATTERN-RECOGNITION - THE IDEAL METHOD [J].
DERDE, MP ;
MASSART, DL .
ANALYTICA CHIMICA ACTA, 1986, 191 :1-16