Deep convolutional networks do not classify based on global object shape

被引：225

作者：

Baker, Nicholas ^{[1
]}

Lu, Hongjing ^{[1
]}

Erlikhman, Gennady ^{[1
,2
]}

Kellman, Philip J. ^{[1
]}

机构：

[1] Univ Calif Los Angeles, Dept Psychol, Los Angeles, CA 90095 USA

[2] Univ Nevada, Reno, NV 89557 USA

来源：

PLOS COMPUTATIONAL BIOLOGY | 2018年 / 14卷 / 12期

基金：

美国国家科学基金会;

关键词：

NEURAL-NETWORKS; RECOGNITION; REPRESENTATION; GRADIENT; SURFACE; COLOR; SET;

D O I：

10.1371/journal.pcbi.1006613

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Deep convolutional networks (DCNNs) are achieving previously unseen performance in object classification, raising questions about whether DCNNs operate similarly to human vision. In biological vision, shape is arguably the most important cue for recognition. We tested the role of shape information in DCNNs trained to recognize objects. In Experiment 1, we presented a trained DCNN with object silhouettes that preserved overall shape but were filled with surface texture taken from other objects. Shape cues appeared to play some role in the classification of artifacts, but little or none for animals. In Experiments 2-4, DCNNs showed no ability to classify glass figurines or outlines but correctly classified some silhouettes. Aspects of these results led us to hypothesize that DCNNs do not distinguish object's bounding contours from other edges, and that DCNNs access some local shape features, but not global shape. In Experiment 5, we tested this hypothesis with displays that preserved local features but disrupted global shape, and vice versa. With disrupted global shape, which reduced human accuracy to 28%, DCNNs gave the same classification labels as with ordinary shapes. Conversely, local contour changes eliminated accurate DCNN classification but caused no difficulty for human observers. These results provide evidence that DCNNs have access to some local shape information in the form of local edge relations, but they have no access to global object shapes.

引用

页数：43

共 44 条

[1]

Nguyen A, 2015, PROC CVPR IEEE, P427, DOI 10.1109/CVPR.2015.7298640

[2]

[Anonymous], 2016, PROC 9 EAI INT C BIO

[3]

[Anonymous], 2017, P 38 C COGNITIVE SCI

[4] SOME INFORMATIONAL ASPECTS OF VISUAL PERCEPTION [J].

ATTNEAVE, F .

PSYCHOLOGICAL REVIEW, 1954, 61 (03) :183-193

[5] Abstract Shape Representation in Human Visual Perception [J].

Baker, Nicholas ;

Kellman, Philip J. .

JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2018, 147 (09) :1295-1308

[6] GENERIC OBJECT RECOGNITION - BUILDING AND MATCHING COARSE DESCRIPTIONS FROM LINE DRAWINGS [J].

BERGEVIN, R ;

LEVINE, MD .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (01) :19-36

[7]

Bethge M., 2014, International Conference on Learning Representations (ICLR 2015)

[8] RECOGNITION-BY-COMPONENTS - A THEORY OF HUMAN IMAGE UNDERSTANDING [J].

BIEDERMAN, I .

PSYCHOLOGICAL REVIEW, 1987, 94 (02) :115-147

[9] SURFACE VERSUS EDGE-BASED DETERMINANTS OF VISUAL RECOGNITION [J].

BIEDERMAN, I ;

JU, G .

COGNITIVE PSYCHOLOGY, 1988, 20 (01) :38-64

[10] Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition [J].

Cadieu, Charles F. ;

Hong, Ha ;

Yamins, Daniel L. K. ;

Pinto, Nicolas ;

Ardila, Diego ;

Solomon, Ethan A. ;

Majaj, Najib J. ;

DiCarlo, James J. .

PLOS COMPUTATIONAL BIOLOGY, 2014, 10 (12)

← 1 2 3 4 5 →