PsyPhy: A Psychophysics Driven Evaluation Framework for Visual Recognition

被引:38
作者
RichardWebster, Brandon [1 ]
Anthony, Samuel E. [2 ,3 ]
Scheirer, Walter J. [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[2] Harvard Univ, Dept Psychol, 33 Kirkland St, Cambridge, MA 02138 USA
[3] Percept Automata Inc, Somerville, MA 02143 USA
基金
美国国家科学基金会;
关键词
Object recognition; visual psychophysics; neuroscience; psychology; evaluation; deep learning; HIERARCHICAL-MODELS; PERFORMANCE;
D O I
10.1109/TPAMI.2018.2849989
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
By providing substantial amounts of data and standardized evaluation protocols, datasets in computer vision have helped fuel advances across all areas of visual recognition. But even in light of breakthrough results on recent benchmarks, it is still fair to ask if our recognition algorithms are doing as well as we think they are. The vision sciences at large make use of a very different evaluation regime known as Visual Psychophysics to study visual perception. Psychophysics is the quantitative examination of the relationships between controlled stimuli and the behavioral responses they elicit in experimental test subjects. Instead of using summary statistics to gauge performance, psychophysics directs us to construct item-response curves made up of individual stimulus responses to find perceptual thresholds, thus allowing one to identify the exact point at which a subject can no longer reliably recognize the stimulus class. In this article, we introduce a comprehensive evaluation framework for visual recognition models that is underpinned by this methodology. Over millions of procedurally rendered 3D scenes and 2D images, we compare the performance of well-known convolutional neural networks. Our results bring into question recent claims of human-like performance, and provide a path forward for correcting newly surfaced algorithmic deficiencies.
引用
收藏
页码:2280 / 2286
页数:7
相关论文
共 42 条
[1]  
Nguyen A, 2015, PROC CVPR IEEE, P427, DOI 10.1109/CVPR.2015.7298640
[2]  
[Anonymous], ARXIV170606969
[3]  
[Anonymous], P 3 INT C LEARNING R
[4]  
[Anonymous], P INT C LEARN REPR
[5]  
[Anonymous], AIM1682 MIT
[6]  
[Anonymous], 2016, 2016 IEEE WINT C APP
[7]  
[Anonymous], IMAGETNET LARGE SCAL
[8]   Towards Open Set Deep Networks [J].
Bendale, Abhijit ;
Boult, Terrance E. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1563-1572
[9]   What object attributes determine canonical views? [J].
Blanz, V ;
Tarr, MJ ;
Bülthoff, HH .
PERCEPTION, 1999, 28 (05) :575-599
[10]   VISUAL PIGMENTS OF RODS AND CONES IN A HUMAN RETINA [J].
BOWMAKER, JK ;
DARTNALL, HJA .
JOURNAL OF PHYSIOLOGY-LONDON, 1980, 298 (JAN) :501-511