A Meta-Analysis of the Reliability of Second Language Listening Tests (1991-2022)

被引:1
作者
Shang, Yuxin [1 ]
Aryadoust, Vahid [1 ]
Hou, Zhuohan [2 ]
机构
[1] Nanyang Technol Univ, Natl Inst Educ, Singapore 639798, Singapore
[2] Zhejiang Univ, Sch Int Studies, Hangzhou 310058, Peoples R China
关键词
listening assessment; meta-analysis; moderator analysis; reliability generalization; validity arguments; SCORE RELIABILITY; INTERNAL CONSISTENCY; COEFFICIENTS; STANDARD; ALPHA; BIAS;
D O I
10.3390/brainsci14080746
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
To investigate the reliability of L2 listening tests and explore potential factors affecting the reliability, a reliability generalization (RG) meta-analysis was conducted in the present study. A total number of 122 alpha coefficients of L2 listening tests from 92 published articles were collected and submitted to a linear mixed effects RG analysis. The papers were coded based on a coding scheme consisting of 16 variables classified into three categories: study features, test features, and statistical results. The results showed an average reliability of 0.818 (95% CI: 0.803 to 0.833), with 40% of reliability estimates falling below the lower bound of CI. The presence of publication bias and heterogeneity was found in the reliability of L2 listening tests, indicating that low reliability coefficients were likely omitted from some published studies. In addition, two factors predicting the reliability of L2 listening tests were the number of items and test type (standardized and researcher- or teacher-designed tests). The study also found that reliability is not a moderator of the relationship between L2 listening scores and theoretically relevant constructs. Reliability induction was identified in reporting the reliability of L2 listening tests, too. Implications for researchers and teachers are discussed.
引用
收藏
页数:28
相关论文
共 99 条
[21]  
Chiedu R E., 2014, Journal of Resourcefulness and Distinction, V8, P1
[22]   THE COMBINATION OF ESTIMATES FROM DIFFERENT EXPERIMENTS [J].
COCHRAN, WG .
BIOMETRICS, 1954, 10 (01) :101-129
[23]  
COOPER H, 2009, Handbook of research synthesis and meta-analysis, DOI [DOI 10.7758/9781610448864, 10.7758/9781610441384, DOI 10.7758/9781610441384, 10.7758/9781610448864]
[24]   The effectiveness of second-language listening strategy instruction: A meta-analysis [J].
Dalman, Mohammadreza ;
Plonsky, Luke .
LANGUAGE TEACHING RESEARCH, 2022, :1039-1068
[25]  
Davidson F, 2006, ROUTL APPL LINGU SER, P1, DOI 10.4324/9780203449066
[26]   Comparing the Pearson and Spearman Correlation Coefficients Across Distributions and Sample Sizes: A Tutorial Using Simulations and Empirical Data [J].
de Winter, Joost C. F. ;
Gosling, Samuel D. ;
Potter, Jeff .
PSYCHOLOGICAL METHODS, 2016, 21 (03) :273-290
[27]   The effects of violating standard item writing principles on tests and students: The consequences of using flawed test items on achievement examinations in medical education [J].
Downing, SM .
ADVANCES IN HEALTH SCIENCES EDUCATION, 2005, 10 (02) :133-143
[28]   From alpha to omega: A practical solution to the pervasive problem of internal consistency estimation [J].
Dunn, Thomas J. ;
Baguley, Thom ;
Brunsden, Vivienne .
BRITISH JOURNAL OF PSYCHOLOGY, 2014, 105 (03) :399-412
[29]   Examining testlet effects in the TestDaF listening section: A testlet response theory modeling approach [J].
Eckes, Thomas .
LANGUAGE TESTING, 2014, 31 (01) :39-61
[30]   Bias in meta-analysis detected by a simple, graphical test [J].
Egger, M ;
Smith, GD ;
Schneider, M ;
Minder, C .
BMJ-BRITISH MEDICAL JOURNAL, 1997, 315 (7109) :629-634