Item desirability matching in forced-choice test construction

被引:23
作者
Pavlov, Goran [1 ,2 ]
Shi, Dexin [1 ]
Maydeu-Olivares, Alberto [1 ,2 ]
Fairchild, Amanda [1 ]
机构
[1] Univ South Carolina, Dept Psychol, 1512 Pendleton St, Columbia, SC 29208 USA
[2] Univ Barcelona, Fac Psychol, Barcelona, Spain
关键词
Forced-choice; Desirability matching; Socially desirable responding; Faking; Inter-item agreement; INTERRATER RELIABILITY; SOCIAL DESIRABILITY; PAIRWISE-PREFERENCE; WEIGHTED KAPPA; PERSONALITY TESTS; AGREEMENT; FAKING; SELECTION; RATINGS; SCALE;
D O I
10.1016/j.paid.2021.111114
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
The forced-choice method has been proposed as a viable strategy to prevent socially desirable responding (SDR) on self-report non-cognitive measures. The ability of the method to eliminate SDR stems from matching items that are perceived as equally desirable into forced-choice item-blocks. The gold standard in quantifying similarity between items in terms of desirability has been the "mean difference index", that is, the absolute difference between items' mean desirability ratings. This index relies on the assumption that items have one true desirability value, as efficiently and unbiasedly estimated by their respective means, and may fail if this assumption does not hold. To circumvent this issue, we propose indexing similarity between items in terms of desirability with several robust measures of absolute agreement (i.e., inter-item agreement indices). Using an empirical example, we show that relying on the mean difference index may lead to suboptimal forced-choice item-block assembly by matching items with a relatively poor inter-item agreement with respect to desirability. R code for computing the proposed agreement indices on a set of desirability ratings is provided, as are recommendations for applied researchers.
引用
收藏
页数:10
相关论文
共 72 条
[1]   Development of a Forced-Choice Measure of Typical-Performance Emotional Intelligence [J].
Anguiano-Carrasco, Cristina ;
MacCann, Carolyn ;
Geiger, Mattis ;
Seybert, Jacob M. ;
Roberts, Richard D. .
JOURNAL OF PSYCHOEDUCATIONAL ASSESSMENT, 2015, 33 (01) :83-97
[2]  
[Anonymous], 1981, STAT METHODS RATES P
[3]   SOCIAL DESIRABILITY RESPONSE DIFFERENCES UNDER RESEARCH, SIMULATED SELECTION, AND FAKING INSTRUCTIONAL SETS [J].
BARTLETT, CJ ;
DOORLEY, R .
PERSONNEL PSYCHOLOGY, 1967, 20 (03) :281-288
[4]   FAKING BY SALES APPLICANTS OF A FORCED CHOICE PERSONALITY-INVENTORY [J].
BASS, BM .
JOURNAL OF APPLIED PSYCHOLOGY, 1957, 41 (06) :403-404
[5]   COEFFICIENT KAPPA - SOME USES, MISUSES, AND ALTERNATIVES [J].
BRENNAN, RL ;
PREDIGER, DJ .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1981, 41 (03) :687-699
[6]   Item Response Modeling of Forced-Choice Questionnaires [J].
Brown, Anna ;
Maydeu-Olivares, Alberto .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2011, 71 (03) :460-502
[7]   Interrater agreement reconsidered:: An alternative to the rwg indices [J].
Brown, RD ;
Hauenstein, NMA .
ORGANIZATIONAL RESEARCH METHODS, 2005, 8 (02) :165-184
[8]   On the Statistical and Practical Limitations of Thurstonian IRT Models [J].
Buerkner, Paul-Christian ;
Schulte, Niklas ;
Holling, Heinz .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2019, 79 (05) :827-854
[9]   On Average Deviation Indices for Estimating Interrater Agreement [J].
Burke, Michael J. ;
Finkelstein, Lisa M. ;
Dusig, Michelle S. .
ORGANIZATIONAL RESEARCH METHODS, 1999, 2 (01) :49-68
[10]   Normative Scoring of Multidimensional Pairwise Preference Personality Scales Using IRT: Empirical Comparisons With Other Formats [J].
Chernyshenko, Oleksandr S. ;
Stark, Stephen ;
Prewett, Matthew S. ;
Gray, Ashley A. ;
Stilson, Frederick R. ;
Tuttle, Matthew D. .
HUMAN PERFORMANCE, 2009, 22 (02) :105-127