Predicting User Confidence During Visual Decision Making

被引:9
作者
Smith, Jim [1 ]
Legg, Phil [1 ]
Matovic, Milos [1 ]
Kinsey, Kristofer [2 ]
机构
[1] Univ West England, Dept Comp Sci & Creat Technol, Bristol BS16 1QY, Avon, England
[2] Univ West England, Dept Psychol, Bristol BS16 1QY, Avon, England
关键词
Human-centred machine learning; confidence; UNCERTAINTY;
D O I
10.1145/3185524
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
People are not infallible consistent "oracles": their confidence in decision-making may vary significantly between tasks and over time. We have previously reported the benefits of using an interface and algorithms that explicitly captured and exploited users' confidence: error rates were reduced by up to 50% for an industrial multi-class learning problem; and the number of interactions required in a design-optimisation context was reduced by 33%. Having access to users' confidence judgements could significantly benefit intelligent interactive systems in industry, in areas such as intelligent tutoring systems and in health care. There are many reasons for wanting to capture information about confidence implicitly. Some are ergonomic, but others are more "social"-such as wishing to understand (and possibly take account of) users' cognitive state without interrupting them. We investigate the hypothesis that users' confidence can be accurately predicted from measurements of their behaviour. Eye-tracking systems were used to capture users' gaze patterns as they undertook a series of visual decision tasks, after each of which they reported their confidence on a 5-point Likert scale. Subsequently, predictive models were built using "conventional" machine learning approaches for numerical summary features derived from users' behaviour. We also investigate the extent to which the deep learning paradigm can reduce the need to design features specific to each application by creating "gaze maps"-visual representations of the trajectories and durations of users' gaze fixations-and then training deep convolutional networks on these images. Treating the prediction of user confidence as a two-class problem (confident/not confident), we attained classification accuracy of 88% for the scenario of new users on known tasks, and 87% for known users on new tasks. Considering the confidence as an ordinal variable, we produced regression models with a mean absolute error of approximate to 0.7 in both cases. Capturing just a simple subset of non-task-specific numerical features gave slightly worse, but still quite high accuracy (e.g., MAE approximate to 1.0). Results obtained with gaze maps and convolutional networks are competitive, despite not having access to longer-term information about users and tasks, which was vital for the "summary" feature sets. This suggests that the gaze-map-based approach forms a viable, transferable alternative to handcrafting features for each different application. These results provide significant evidence to confirm our hypothesis, and offer a way of substantially improving many interactive artificial intelligence applications via the addition of cheap non-intrusive hardware and computationally cheap prediction algorithms.
引用
收藏
页数:30
相关论文
共 27 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Individual consistency in the accuracy and distribution of confidence judgments
    Ais, Joaquin
    Zylberberg, Ariel
    Barttfeld, Pablo
    Sigman, Mariano
    [J]. COGNITION, 2016, 146 : 377 - 386
  • [3] [Anonymous], TECHNICAL REPORT
  • [4] [Anonymous], FUZZY UNCERTAIN LEAR
  • [5] [Anonymous], P 24 INT FLOR ART IN
  • [6] Arshad Syed Z, 2015, P ANN M AUSTR SPEC I, P352
  • [7] Gaze dwell times on acute trauma injuries missed because of satisfaction of search
    Berbaum, KS
    Brandser, EA
    Franken, EA
    Dorfman, DD
    Caldwell, RT
    Krupinski, EA
    [J]. ACADEMIC RADIOLOGY, 2001, 8 (04) : 304 - 314
  • [8] Identifying mislabeled training data
    Brodley, CE
    Friedl, MA
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1999, 11 : 131 - 167
  • [9] Adaptive surface inspection via interactive evolution
    Caleb-Solly, P.
    Smith, J. E.
    [J]. IMAGE AND VISION COMPUTING, 2007, 25 (07) : 1058 - 1072
  • [10] Chollet F., 2015, about us