Impact of data bin size on the classification of diesel fuels using comprehensive two-dimensional gas chromatography with principal component analysis

被引:24
作者
Sudol, Paige E. [1 ]
Gough, Derrick V. [1 ]
Prebihalo, Sarah E. [1 ]
Synovec, Robert E. [1 ]
机构
[1] Univ Washington, Dept Chem, Box 351700, Seattle, WA 98195 USA
关键词
Comprehensive two-dimensional gas; chromatography; Data binning; Principal component analysis; Degree-of-class separation; Diesel classification; CHEMOMETRIC ANALYSIS; MASS SPECTROMETRY; QUANTITATIVE-ANALYSIS; CONVENTIONAL DIESEL; PEAK-CAPACITY; METABOLOMICS; SEPARATION; REDUCTION; BIODIESEL; ALIGNMENT;
D O I
10.1016/j.talanta.2019.120239
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Principal component analysis (PCA) is a widely applied chemometric tool for classifying samples using comprehensive two-dimensional (2D) gas chromatography (GC x GC) separation data. Classification via PCA can be improved by 2D binning of the data. A "standard operating procedure (SOP) bin size" is often applied to improve the S/N and to mitigate potential retention time misalignment issues. The SOP bin size is generally selected to be slightly larger than the typical 2D peak dimensions. In this study we examine to what extent a single SOP bin size is optimal for all of the class comparisons that can be made in a single PCA scores plot. For this purpose, a GC x GC-FID dataset comprised of 5 different diesel fuels (i.e., 5 sample classes), each run with 4 replicates using a reverse column configuration (polar ID column and non-polar 2D column) was utilized. The dataset was collected within about one day, which minimized retention time misalignment in order to allow the study to focus on S/N enhancement concurrent with maintaining the chemical selectivity provided by the GC x GC separations. A total of 110 bin sizes were evaluated. Degree-of-class separation (DCS) was utilized as a quantitative metric to assess the impact of binning in improving separation in the scores plot. The DCS was calculated pair-wise between nearest neighbor sample classes for each of the 5 sample classes in the scores plot (5 sample class pairs). Results indicated the SOP bin size did not provide the highest DCS for any of the 5 fuel pairs. Each fuel pair is found to have its own optimal bin size, suggesting the binning finds the balance between S/N optimization concurrent with leveraging the chemical selectivity information differences in the samples as manifested in their GC x GC separation "patterns". Robustness of the findings in this study were supported by leaving out one fuel at a time and re-running the PCA models.
引用
收藏
页数:9
相关论文
共 38 条
  • [1] Investigating weathering in light diesel oils using comprehensive two-dimensional gas chromatography-High resolution mass for spectrometry and pixel-based analysis: Possibilities and limitations
    Alexandrino, Guilherme L.
    Malmborg, Jonas
    Augusto, Fabio
    Christensen, Jan H.
    [J]. JOURNAL OF CHROMATOGRAPHY A, 2019, 1591 : 155 - 161
  • [2] Methods for one- and two-dimensional gas chromatography with flame ionization detection for identification of Mycobacterium tuberculosis in sputum
    Arslan, Fatma Nur
    Kolk, Arend
    Janssen, Hans-Gerd
    [J]. JOURNAL OF CHROMATOGRAPHY B-ANALYTICAL TECHNOLOGIES IN THE BIOMEDICAL AND LIFE SCIENCES, 2019, 1124 : 204 - 217
  • [3] Ultrafast separations via pulse flow valve modulation to enable high peak capacity multidimensional gas chromatography
    Bahaghighat, H. Daniel
    Freye, Chris E.
    Gough, Derrick V.
    Sudol, Paige E.
    Synovec, Robert E.
    [J]. JOURNAL OF CHROMATOGRAPHY A, 2018, 1573 : 115 - 124
  • [4] Improving the quality of biomarker candidates in untargeted metabolomics via peak table-based alignment of comprehensive two-dimensional gas chromatography-mass spectrometry data
    Bean, Heather D.
    Hill, Jane E.
    Dimandja, Jean-Marie D.
    [J]. JOURNAL OF CHROMATOGRAPHY A, 2015, 1394 : 111 - 117
  • [5] Construction and evaluation of confidence ellipses applied at sensory data
    Cadoret, Marine
    Husson, Francois
    [J]. FOOD QUALITY AND PREFERENCE, 2013, 28 (01) : 106 - 115
  • [6] Visualizing Big data with Compressed Score Plots: Approach and research challenges
    Camacho, Jose
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2014, 135 : 110 - 125
  • [7] Unsupervised classification of petroleum Certified Reference Materials and other fuels by chemometric analysis of gas chromatography-mass spectrometry data
    de Carvalho Rocha, Werickson Fortunato
    Schantz, Michele M.
    Sheen, David A.
    Chu, Pamela M.
    Lippa, Katrice A.
    [J]. FUEL, 2017, 197 : 248 - 258
  • [8] The Mahalanobis distance
    De Maesschalck, R
    Jouan-Rimbaud, D
    Massart, DL
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2000, 50 (01) : 1 - 18
  • [9] Use of cluster separation indices and the influence of outliers: application of two new separation indices, the modified silhouette index and the overlap coefficient to simulated data and mouse urine metabolomic profiles
    Dixon, Sarah J.
    Heinrich, Nina
    Holmboe, Maria
    Schaefer, Michele L.
    Reed, Randall R.
    Trevejo, Jose
    Brereton, Richard G.
    [J]. JOURNAL OF CHEMOMETRICS, 2009, 23 (1-2) : 19 - 31
  • [10] Partial least squares analysis of rocket propulsion fuel data using diaphragm valve-based comprehensive two-dimensional gas chromatography coupled with flame ionization detection
    Freye, Chris E.
    Fitz, Brian D.
    Billingsley, Matthew C.
    Synovec, Robert E.
    [J]. TALANTA, 2016, 153 : 203 - 210