Directional nature of the product-moment correlation coefficient and some consequences

被引:3
作者
Metsamuuronen, Jari [1 ,2 ]
机构
[1] Finnish Educ Evaluat Ctr FINEEC, Helsinki, Finland
[2] Univ Turku, Turku Res Inst Learning Analyt, Turku, Finland
关键词
product-moment correlation coefficient; coefficient eta; directional coefficient; eta squared; Goodman-Kruskal G; Somers D; RANGE RESTRICTION; MATHEMATICAL CONTRIBUTIONS; POLYSERIAL CORRELATION; CAUTIONARY NOTE; EFFECT SIZE; BISERIAL R; SOMERS-D; ASSOCIATION; STATISTICS; SELECTION;
D O I
10.3389/fpsyg.2022.988660
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Product-moment correlation coefficient (PMC) is usually taken as a symmetric measure of the association because it produces an equal estimate irrespective of how two variables in the analysis are declared. However, in case the other variable has or both have non-continuous scales and when the scales of the variables differ from each other, PMC is unambiguously a directional measure directed so that the variable with a wider scale (X) explains the order or response pattern in the variable with a narrower scale (g) and not in the opposite direction or symmetrically. If the scales of the variables differ from each other, PMC is also prone to give a radical underestimation of the association, that is, the estimates are deflated. Both phenomena have obvious consequences when it comes to interpreting and speaking of the results. Empirical evidence shows that the effect of directionality increases by the discrepancy of the number of categories of the variables of interest. In the measurement modelling setting, if the scale of the score variable is four times wider than the scale of the item, the directionality is notable: score explains the order in the item and no other way around nor symmetrically. This is regarded as a positive and logical direction from the test theory viewpoint. However, the estimate of association may be radically deflated, specifically, if the item has an extremely difficult level. Whenever the statistic r(2) or R-2 is used, as is usual in general scatterplots or when willing to express the explaining power of the variables, this statistic is always a directional measure, and the estimate is an underestimate if the scales differ from each other; this should be kept in mind when interpreting r-squared statistics as well as with the related statistic eta squared within general linear modelling.
引用
收藏
页数:19
相关论文
共 90 条
[1]  
[Anonymous], 1989, Stat. Sci.
[2]  
[Anonymous], 1988, Non- parametric statistics for the behavioral sciences
[3]   THE CORRELATION RATIO [J].
Ayres, Leonard P. .
JOURNAL OF EDUCATIONAL RESEARCH, 1920, 2 (01) :452-456
[4]  
Biggs D., 1991, Journal of Applied Statistics, V18, P49, DOI DOI 10.1080/02664769100000005
[5]  
Bravais A., 1844, Mem. Acad. Roy. Sei. Inst. France, Sci. Math, et Phys., V9, P255
[6]  
Byrne B, 2010, INTERNATIONAL HANDBOOK OF PSYCHOLOGY IN EDUCATION, P3
[7]  
Camp B.H., 1933, J AM STAT ASSOC, V28, P395, DOI [10.1080/01621459.1933.10503239, DOI 10.1080/01621459.1933.10503239]
[8]  
Chan D, 2009, STATISTICAL AND METHODOLOGICAL MYTHS AND URBAN LEGENDS: DOCTRINE, VERITY AND FABLE IN THE ORGANIZATIONAL AND SOCIAL SCIENCES, P309
[9]   A fast algorithm for computing distance correlation [J].
Chaudhuri, Arin ;
Hu, Wenhao .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 135 :15-24
[10]  
Cleff T., 2019, Applied statistics and multivariate data analysis for business and economics, DOI [DOI 10.1007/978-3-030-17767-61, 10.1007/978-3-030-17767-6, DOI 10.1007/978-3-030-17767-6]