共 5 条
Data Size Reduction Strategy for the Classification of Breath and Air Samples Using Multicapillary Column-Ion Mobility Spectrometry
被引:26
|作者:
Szymanska, Ewa
[1
,2
]
Brodrick, Emma
[3
]
Williams, Mark
[3
]
Davies, Antony N.
[3
,4
]
van Manen, Henk-Jan
[4
]
Buydens, Lutgarde M. C.
[2
]
机构:
[1] TI COAST, NL-1098 XH Amsterdam, Netherlands
[2] Radboud Univ Nijmegen, Inst Mol & Mat, NL-6500 GL Nijmegen, Netherlands
[3] Univ South Wales, Fac Comp Engn & Sci, Sch Appl Sci, Pontypridd CF37 1DL, M Glam, Wales
[4] AkzoNobel NV, Supply Chain, Res & Dev, Strateg Res Grp Measurement & Analyt Sci, NL-7400 AA Deventer, Netherlands
关键词:
2-DIMENSIONAL WAVELET COMPRESSION;
VOLATILE ORGANIC-COMPOUNDS;
SPECTRA;
D O I:
10.1021/ac503857y
中图分类号:
O65 [分析化学];
学科分类号:
070302 ;
081704 ;
摘要:
Ion mobility spectrometry combined with multicapillary column separation (MCC-IMS) is a well-known technology for detecting volatile organic compounds (VOCs) in gaseous samples. Due to their large data size, processing of MCC-IMS spectra is still the main bottleneck of data analysis, and there is an increasing need for data analysis strategies in which the size of MCC-IMS data is reduced to enable further analysis. In our study, the first untargeted chemometric strategy is developed and employed in the analysis of MCC-IMS spectra from 264 breath and ambient air samples. This strategy does not comprise identification of compounds as a primary step but includes several preprocessing steps and a discriminant analysis. Data size is significantly reduced in three steps. Wavelet transform, mask construction, and sparse-partial least squares-discriminant analysis (s-PLS-DA) allow data size reduction with down to 50 variables relevant to the goal of analysis. The influence and compatibility of the data reduction tools are studied by applying different settings of the developed strategy. Loss of information after preprocessing is evaluated, e.g., by comparing the performance of classification models for different classes of samples. Finally, the interpretability of the classification models is evaluated, and regions of spectra that are related to the identification of potential analytical biomarkers are successfully determined. This work will greatly enable the standardization of analytical procedures across different instrumentation types promoting the adoption of MCC-IMS technology in a wide range of diverse application fields.
引用
收藏
页码:869 / 875
页数:7
相关论文