Discriminability and uncertainty in principal component analysis (PCA) of temporal check-all-that-apply (TCATA) data

被引:19
作者
Castura, J. C. [1 ]
Rutledge, D. N. [2 ,3 ]
Ross, C. F. [4 ]
Naes, T. [5 ,6 ]
机构
[1] Compusense Inc, 255 Speedvale Ave W, Guelph, ON N1H 1C5, Canada
[2] Univ Paris Saclay, UMR SayFood, AgroParisTech, INRAE, F-75005 Paris, France
[3] Charles Sturt Univ, Natl Wine & Grape Ind Ctr, Wagga Wagga, NSW, Australia
[4] Washington State Univ, Sch Food Sci, Pullman, WA 99164 USA
[5] Nofima AS, Osloveien 1,POB 210, N-1431 As, Norway
[6] Univ Copenhagen, Fac Sci, Dept Food Sci, Rolighetsvej 30, DK-1958 Copenhagen, Denmark
关键词
Principal component analysis (PCA); Temporal check-all-that-apply (TCATA); Bootstrap; Discriminability; Uncertainty; Wine; CONFIDENCE; BOOTSTRAP;
D O I
10.1016/j.foodqual.2021.104370
中图分类号
TS2 [食品工业];
学科分类号
0832 ;
摘要
Temporal check-all-that-apply (TCATA) data can be summarized and explored using principal component analysis (PCA). Here we analyze TCATA data on Syrah wines obtained from a trained sensory panel. We evaluate new and existing methods to explore the uncertainty in the PCA scores. To do so, we use the bootstrap procedure to obtain many virtual panels from the real panel's data. Virtual-panel PCA scores are obtained using two methods. The first method, called the partial bootstrap (PB), obtains virtual-panel scores from regression. The second method, called the truncated total bootstrap (TTB), applies PCA to the virtual-panel results to obtain scores, which are truncated and superimposed on the real-panel scores by Procrustes rotation. We use the virtual scores from each method to investigate uncertainty in the real-panel PCA scores visually and numerically. To understand the uncertainty of the scores, we obtain confidence ellipses (CEs) and their areas, as well as confidence intervals (CIs) and their widths. Next, to determine whether PCA scores for different samples are well separated, we propose a procedure for approximating the standard errors of sample differences and correcting for multiple comparisons. We propose a discriminability index, and show that it can enhance the interpretability of PCA results. We incorporate graphical features into our PCA biplots to visualize discriminability. We did not find a large difference between the PB and TTB methods for understanding the uncertainty and discriminability in PCA scores. Although the TCATA data that we analyzed have a special structure, the methodological approaches presented here can be readily adapted to other applications of PCA.
引用
收藏
页数:11
相关论文
共 32 条
  • [1] [Anonymous], 2007, APPL MULTIVARIATE ST
  • [2] Bootstrap based confidence limits in principal component analysis - A case study
    Babamoradi, Hamid
    van den Berg, Frans
    Rinnan, Asmund
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2013, 120 : 97 - 105
  • [3] Baker AK, 2020, Mendeley Data, DOI 10.17632/JV22XN66KY.2
  • [4] Temporal Check-All-That-Apply Characterization of Syrah Wine
    Baker, Allison K.
    Castura, John C.
    Ross, Carolyn F.
    [J]. JOURNAL OF FOOD SCIENCE, 2016, 81 (06) : S1521 - S1529
  • [5] Bivand R. S., 2013, Applied Spatial Data Analysis with R, V2nd, DOI 10.1007/978-1-4614-7618-4
  • [6] Borcard D, 2011, USE R, P1, DOI 10.1007/978-1-4419-7976-6
  • [7] Brito P., 2007, Selected Contributions in Data Analysis and Classification, P581, DOI DOI 10.1007/978-3-540-73560-1_55
  • [8] Construction and evaluation of confidence ellipses applied at sensory data
    Cadoret, Marine
    Husson, Francois
    [J]. FOOD QUALITY AND PREFERENCE, 2013, 28 (01) : 106 - 115
  • [9] Castura J.C., 2020, TEMPR TEMPORAL SENSO
  • [10] Using contrails contrails and animated sequences to visualize uncertainty in dynamic sensory profiles obtained from temporal check-all-that-apply (TCATA) data
    Castura, John C.
    Baker, Allison K.
    Ross, Carolyn F.
    [J]. FOOD QUALITY AND PREFERENCE, 2016, 54 : 90 - 100