PsyPhy: A Psychophysics Driven Evaluation Framework for Visual Recognition

被引：38

作者：

RichardWebster, Brandon ^{[1
]}

Anthony, Samuel E. ^{[2
,3
]}

Scheirer, Walter J. ^{[1
]}

机构：

[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA

[2] Harvard Univ, Dept Psychol, 33 Kirkland St, Cambridge, MA 02138 USA

[3] Percept Automata Inc, Somerville, MA 02143 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2019年 / 41卷 / 09期

基金：

美国国家科学基金会;

关键词：

Object recognition; visual psychophysics; neuroscience; psychology; evaluation; deep learning; HIERARCHICAL-MODELS; PERFORMANCE;

D O I：

10.1109/TPAMI.2018.2849989

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

By providing substantial amounts of data and standardized evaluation protocols, datasets in computer vision have helped fuel advances across all areas of visual recognition. But even in light of breakthrough results on recent benchmarks, it is still fair to ask if our recognition algorithms are doing as well as we think they are. The vision sciences at large make use of a very different evaluation regime known as Visual Psychophysics to study visual perception. Psychophysics is the quantitative examination of the relationships between controlled stimuli and the behavioral responses they elicit in experimental test subjects. Instead of using summary statistics to gauge performance, psychophysics directs us to construct item-response curves made up of individual stimulus responses to find perceptual thresholds, thus allowing one to identify the exact point at which a subject can no longer reliably recognize the stimulus class. In this article, we introduce a comprehensive evaluation framework for visual recognition models that is underpinned by this methodology. Over millions of procedurally rendered 3D scenes and 2D images, we compare the performance of well-known convolutional neural networks. Our results bring into question recent claims of human-like performance, and provide a path forward for correcting newly surfaced algorithmic deficiencies.

引用

页码：2280 / 2286

页数：7

共 42 条

[31] Hierarchical models of object recognition in cortex [J].

Riesenhuber, M ;

Poggio, T .

NATURE NEUROSCIENCE, 1999, 2 (11) :1019-1025

[32] ImageNet Large Scale Visual Recognition Challenge [J].

Russakovsky, Olga ;

Deng, Jia ;

Su, Hao ;

Krause, Jonathan ;

Satheesh, Sanjeev ;

Ma, Sean ;

Huang, Zhiheng ;

Karpathy, Andrej ;

Khosla, Aditya ;

Bernstein, Michael ;

Berg, Alexander C. ;

Fei-Fei, Li .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) :211-252

[33] Perceptual Annotation: Measuring Human Vision to Improve Computer Vision [J].

Scheirer, Walter J. ;

Anthony, Samuel E. ;

Nakayama, Ken ;

Cox, David D. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (08) :1679-1686

[34]

SEARLE JR, 1980, BEHAV BRAIN SCI, V3, P417, DOI 10.1017/S0140525X00006038

[35]

Szegedy C, 2015, PROC CVPR IEEE, P1, DOI 10.1109/CVPR.2015.7298594

[36] How to Grow a Mind: Statistics, Structure, and Abstraction [J].

Tenenbaum, Joshua B. ;

Kemp, Charles ;

Griffiths, Thomas L. ;

Goodman, Noah D. .

SCIENCE, 2011, 331 (6022) :1279-1285

[37] A Deeper Look at Dataset Bias [J].

Tommasi, Tatiana ;

Patricia, Novi ;

Caputo, Barbara ;

Tuytelaars, Tinne .

PATTERN RECOGNITION, GCPR 2015, 2015, 9358 :504-516

[38]

Torralba A, 2011, PROC CVPR IEEE, P1521, DOI 10.1109/CVPR.2011.5995347

[39]

Vondrick C., 2015, Advances in Neural Information Processing Systems, P289

[40]

Wu JJ, 2016, ADV NEUR IN, V29

← 1 2 3 4 5 →