Review and application of functional data analysis to chemical data-The example of the comparison, classification, and database search of forensic ink chromatograms

被引:22
作者
Burfield, Riley [1 ]
Neumann, Cedric [1 ]
Saunders, Christopher P. [1 ]
机构
[1] S Dakota State Univ, Dept Math & Stat, Brookings, SD 57007 USA
关键词
Functional data analysis; Chromatography; Spectroscopy; Classification; Comparison; Spectra;
D O I
10.1016/j.chemolab.2015.07.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Functional data analysis is a relatively recent statistical method that can be applied to any dataset that can be thought of as a function. Functional data analysis considers functions as random elements. Modem chromatographic or spectroscopic techniques typically record analytical outputs as a function of time or wavelength. The purpose of this paper is to investigate the potential of functional data analysis for the characterization, comparison, and classification of chemical data. Forensic examination of ink is used as the main example in this paper as it covers different aspects of functional data analysis: (a) thin-layer chromatograms resulting from analysis of ink samples are characterized as functions of time and wavelength; (b) multiple samples analyzed at different times, or by different analysts, are registered into a common space; (c) a dimension reduction technique is applied to the sample functions to enable (d) their use for comparing between ink samples and for clustering large databases of inks. Our algorithms showed excellent performance and can readily be implemented to search and retrieve chemical profiles in large databases. From a theoretical standpoint, functional data analysis allows for a natural extension of multivariate analysis to datasets that can be thought of as functions. Algorithmically, functional data analysis proves to be a powerful technique that enables to detect functions minima and maxima, register multiple functions to a common space, and control the dimensionality and smoothness of a functional dataset Nevertheless, we found that the implementation of functional data analysis is computationally complex when compared to classic multivariate analysis. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:97 / 106
页数:10
相关论文
共 20 条
[1]  
Aguilera A.M., 2013, Open Journal of Statistics, V3, P334, DOI DOI 10.4236/0JS.2013.35039
[2]  
[Anonymous], JOINT ANAL FO SPEECH
[3]  
[Anonymous], 2011, Functional data analysis for phonetic research
[4]  
[Anonymous], PRINCIPLE COMPONENT
[5]  
[Anonymous], FORENSIC SCI INT
[6]  
[Anonymous], STAT ECONOMETRICS WO
[7]  
Genz A., MVTNORM MULTIVARIATE
[8]   Analysis of non-stationary dynamics in the financial system [J].
Guharay, Samar K. ;
Thakur, Gaurav S. ;
Goodman, Fred J. ;
Rosen, Scott L. ;
Houser, Daniel .
ECONOMICS LETTERS, 2013, 121 (03) :454-457
[9]   The Classification of Inkjet Inks Using AccuTOF DART (Direct Analysis in Real Time) Mass SpectrometryA Preliminary Study [J].
Houlgrave, Stephanie ;
LaPorte, Gerald M. ;
Stephens, Joseph C. ;
Wilson, Justin L. .
JOURNAL OF FORENSIC SCIENCES, 2013, 58 (03) :813-821
[10]   Forensic examination of ink by high-performance thin layer chromatography-The United States Secret Service Digital Ink Library [J].
Neumann, Cedric ;
Ramotowski, Robert ;
Genessay, Thibault .
JOURNAL OF CHROMATOGRAPHY A, 2011, 1218 (19) :2793-2811