The Mathematical Meaninglessness of the NASA Task Load Index: A Level of Measurement Analysis

被引:14
作者
Bolton, Matthew L. [1 ]
Biltekoff, Elliot [2 ]
Humphrey, Laura [3 ]
机构
[1] Univ Virginia, Dept Syst & Informat Engn, Charlottesville, VA 22903 USA
[2] SUNY Buffalo, Dept Syst & Ind Engn, Buffalo, NY 14260 USA
[3] USAF, Res Lab, Wright Patterson AFB, OH 45433 USA
关键词
Particle measurements; Atmospheric measurements; Task analysis; Psychology; Correlation; Standards; Power measurement; Human performance assessment; psychometrics and testing; Index Terms; workload; SUBJECTIVE MENTAL WORKLOAD; SCIENCE; TLX;
D O I
10.1109/THMS.2023.3263482
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human mental workload can profoundly impact human performance and is thus an important consideration in the design and operation of many systems. The standard method for assessing human mental workload is the NASA Task Load Index (NASA-TLX). This involves a human operator subjectively rating a task based on six dimensions. These dimensions are combined into a single workload score using one of two methods: scaling and summing the dimensions (where scales are derived from a paired comparisons procedure) or averaging dimensions together. Despite its widespread use, the level of measurement of NASA-TLX's dimensions and its computed workload score has not been investigated. Additionally, nobody has researched whether NASA-TLX's two approaches for computing overall workload are mathematically meaningful with respect to the constituent dimensions' levels of measurement. This is a serious deficiency. Knowing what the level of measurement is for NASA-TLX scores will determine what mathematics can be meaningfully applied to them. Furthermore, if NASA-TLX workload syntheses are mathematically meaningless, then the measure lacks construct validity. The research presented in this article used a previously developed method to evaluate the level of measurement of NASA-TLX workload and its dimensions. Results show that the dimensions can, in most situations, be treated as interval in population analyses and ordinal for individuals. Our results also suggest that the methods for combining dimensions into workload scores are meaningless. We recommend that analysts evaluate the dimensions of NASA-TLX without combining them.
引用
收藏
页码:590 / 599
页数:10
相关论文
共 48 条
[1]   Subjective rating scales: science or art? [J].
Annett, J .
ERGONOMICS, 2002, 45 (14) :966-987
[2]  
[Anonymous], 2018, Proceedings of the Human Factors and Ergonomics Society Annual Meeting, V62, P44, DOI 10.1177/1541931218621010
[3]  
[Anonymous], 1989, Advances in industrial ergonomics and safety
[4]  
Barrett P., 2003, J MANAGE PSYCHOL, V18, P421, DOI [10.1108/02683940310484026, DOI 10.1108/02683940310484026]
[5]  
Bolton Matthew L., 2022, Proceedings of the Human Factors and Ergonomics Society Annual Meeting, P80, DOI 10.1177/1071181322661215
[6]  
Bolton Matthew L., 2008, 2008 IEEE International Conference on Systems, Man and Cybernetics (SMC 2008), P1073, DOI 10.1109/ICSMC.2008.4811424
[7]   The Level of Measurement of Subjective Situation Awareness and Its Dimensions in the Situation Awareness Rating Technique (SART) [J].
Bolton, Matthew L. ;
Biltekoff, Elliot ;
Humphrey, Laura .
IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2022, 52 (06) :1147-1154
[8]  
Braarud P. O., 2021, INT J IND ERGONOM, V86
[9]   NASA TLX: Software for assessing subjective mental workload [J].
Cao, Alex ;
Chintamani, Keshav K. ;
Pandya, Abhilash K. ;
Ellis, R. Darin .
BEHAVIOR RESEARCH METHODS, 2009, 41 (01) :113-117
[10]   OpenMATB: A Multi-Attribute Task Battery promoting task customization, software extensibility and experiment replicability [J].
Cegarra, J. ;
Valery, B. ;
Avril, E. ;
Calmettes, C. ;
Navarro, J. .
BEHAVIOR RESEARCH METHODS, 2020, 52 (05) :1980-1990