Psychometric and Machine Learning Approaches for Diagnostic Assessment and Tests of Individual Classification

被引：20

作者：

Gonzalez, Oscar ^{[1
]}

机构：

[1] Univ N Carolina, Chapel Hill, NC 27599 USA

来源：

PSYCHOLOGICAL METHODS | 2021年 / 26卷 / 02期

关键词：

psychometrics; machine learning; classification; diagnostic assessment; item response theory; OPERATING CHARACTERISTIC ANALYSIS; REGRESSION TREES; SELECTION; PACKAGE; SCALE; TOOL;

D O I：

10.1037/met0000317

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

Assessments are commonly used to make a decision about an individual, such as grade placement, treatment assignment, job selection, or to inform a diagnosis. A psychometric approach to classify respondents based on the assessment would aggregate items into a score, and then each respondent's score is compared to a cut score. In contrast, a machine learning approach to classify respondents would build a model to predict the probability of belonging to a specific class from assessment items, and then respondents are classified based on their predicted probability of belonging to that class. It remains unclear whether psychometric and machine learning methods have comparable classification accuracy or if 1 method is preferable in all or some situations. In the context of diagnostic assessment, this study used Monte Carlo simulation methods to compare the classification accuracy of psychometric and machine learning methods as a function of the diagnosis-test correlation, prevalence, sample size, and the structure of the diagnostic assessment. Results suggest that machine learning models using logistic regression or random forest could have comparable classification accuracy to the psychometric methods using estimated item response theory scores. Therefore, machine learning models could provide a viable alternative for classification when psychometric methods are not feasible. Methods are illustrated with an empirical example predicting an oppositional defiant disorder diagnosis from a behavior disorders scale in children of age seven. Strengths and limitations for each of the methods are examined, and the overlap between the field of machine learning and psychometrics is discussed. Translational Abstract Assessments and tests are often used to make decisions about an individual, such as deciding who will graduate from high school, who gets hired for a job, and who is referred to more services or treatment. In these tests or assessments, individuals are asked to respond to a series of items, and depending on their item responses, a decision about the individual is made. Traditionally, researchers and assessment specialists have used a psychometric approach to score item responses and then check if the score is above a cut score. However, data-driven, exploratory methods from the area of machine learning could be used for similar purposes. For example, machine learning methods could be used to predict the assessment decision directly from the item responses, without the need to aggregate item responses beforehand. It remains unclear if the classification accuracy of machine learning methods is comparable with the psychometric methods typically used for classification. This study used Monte Carlo simulation methods to study classification accuracy of psychometric and machine learning methods across a variety of assessment conditions, among which are the number of items and response categories of the assessment, the strength of the relationship between the diagnosis and the assessment, sample size, and diagnosis prevalence. The methods were also illustrated in an empirical example predicting an oppositional defiant disorder diagnosis in children of age 7. Strengths and limitations for each of the methods are examined, and the overlap between the field of machine learning and psychometrics is discussed.

引用

页码：236 / 254

页数：19

共 54 条

[1] Achenbach T., 2013, ENCY AUTISM SPECTRUM, P31, DOI [10.1007/978-1-4419-1698-3219, DOI 10.1007/978-1-4419-1698-3_219]
[2] [Anonymous], 2009, The theory and practice of item response theory
[3] Birnbaum A, 1968, STAT THEORIES MENTAL
[4] MARGINAL MAXIMUM-LIKELIHOOD ESTIMATION OF ITEM PARAMETERS - APPLICATION OF AN EM ALGORITHM
BOCK, RD
AITKIN, M
[J]. PSYCHOMETRIKA, 1981, 46 (04) : 443 - 459
[5] Random forests
Breiman, L
[J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
[6] Breiman L., 1984, STAT PROBABILITY SER, DOI 10.1201/9781315139470
[7] A Few Useful Things to Know About Machine Learning
Domingos, Pedro
[J]. COMMUNICATIONS OF THE ACM, 2012, 55 (10) : 78 - 87
[8] Applying item response theory (IRT) modeling to questionnaire development, evaluation, and refinement
Edelen, Maria Orlando
Reeve, Bryce B.
[J]. QUALITY OF LIFE RESEARCH, 2007, 16 (Suppl 1) : 5 - 18
[9] A Diagnostic Procedure to Detect Departures From Local Independence in Item Response Theory Models
Edwards, Michael C.
Houts, Carrie R.
Cai, Li
[J]. PSYCHOLOGICAL METHODS, 2018, 23 (01) : 138 - 149
[10] Egan J. P., 1975, Signal detection theory and ROC-analysis

← 1 2 3 4 5 6 →