Partial Least Squares tutorial for analyzing neuroimaging data

被引:22
作者
Van Roon, Patricia [1 ,2 ]
Zakizadeh, Jila [1 ,2 ]
Chartier, Sylvain [2 ]
机构
[1] Carleton Univ, Sch Psychol, Ottawa, ON, Canada
[2] Univ Ottawa, Sch Psychol, Ottawa, ON, Canada
来源
QUANTITATIVE METHODS FOR PSYCHOLOGY | 2014年 / 10卷 / 02期
关键词
partial least squares; PLS; regression; correlation; Mathematica; NIPALS;
D O I
10.20982/tqmp.10.2.p200
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Partial least squares (PLS) has become a respected and meaningful soft modeling analysis technique that can be applied to very large datasets where the number of factors or variables is greater than the number of observations. Current biometric studies (e.g., eye movements, EKG, body movements, EEG) are often of this nature. PLS eliminates the multiple linear regression issues of over-fitting data by finding a few underlying or latent variables (factors) that account for most of the variation in the data. In real-world applications, where linear models do not always apply, PLS can model the non-linear relationship well. This tutorial introduces two PLS methods, PLS Correlation (PLSC) and PLS Regression (PLSR) and their applications in data analysis which are illustrated with neuroimaging examples. Both methods provide straightforward and comprehensible techniques for determining and modeling relationships between two multivariate data blocks by finding latent variables that best describe the relationships. In the examples, the PLSC will analyze the relationship between neuroimaging data such as Event-Related Potential (ERP) amplitude averages from different locations on the scalp with their corresponding behavioural data. Using the same data, the PLSR will be used to model the relationship between neuroimaging and behavioural data. This model will be able to predict future behaviour solely from available neuroimaging data. To find latent variables, Singular Value Decomposition (SVD) for PLSC and Non-linear Iterative PArtial Least Squares (NIPALS) for PLSR are implemented in this tutorial. SVD decomposes the large data block into three manageable matrices containing a diagonal set of singular values, as well as left and right singular vectors. For PLSR, NIPALS algorithms are used because they provide a more precise estimation of the latent variables. Mathematica notebooks are provided for each PLS method with clearly labeled sections and subsections. The notebook examples show the entire process and the results are reported in the Section 3 Examples.
引用
收藏
页码:200 / 215
页数:16
相关论文
共 20 条
  • [1] Abdi Herve, 2013, Methods Mol Biol, V930, P549, DOI 10.1007/978-1-62703-059-5_23
  • [2] Partial least squares regression and projection on latent structure regression (PLS Regression)
    Abdi, Herve
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (01): : 97 - 106
  • [3] Beck M., 2010, ART PROOF UNDERGRADU, P7023, DOI [10.1007/978-1-4419-7023-7, DOI 10.1007/978-1-4419-7023-7]
  • [4] Bona M., 2011, WALK COMBINATORICS, P568
  • [5] Brown JD., 2019, JALT TESTING EVALUAT, V13, P26
  • [6] Ghazy R. A., 2008, PROGR ELECTROMAGNETI, V8, P147
  • [7] Haenlein M., 2004, UNDERST STAT, V3, P283, DOI [DOI 10.1207/S15328031US0304_4, 10.1207/s15328031us0304_4]
  • [8] Haykin S., 1991, ADAPTIVE FILTER THEO
  • [9] Henning G., 1989, LANG TEST, V6, P95, DOI [10.1177/026553228900600108, DOI 10.1177/026553228900600108]
  • [10] Partial Least Squares (PLS) methods for neuroimaging: A tutorial and review
    Krishnan, Anjali
    Williams, Lynne J.
    McIntosh, Anthony Randal
    Abdi, Herve
    [J]. NEUROIMAGE, 2011, 56 (02) : 455 - 475