Deep convolutional networks do not classify based on global object shape

被引：207

作者：

Baker, Nicholas ^{[1
]}

Lu, Hongjing ^{[1
]}

Erlikhman, Gennady ^{[1
,2
]}

Kellman, Philip J. ^{[1
]}

机构：

[1] Univ Calif Los Angeles, Dept Psychol, Los Angeles, CA 90095 USA

[2] Univ Nevada, Reno, NV 89557 USA

来源：

PLOS COMPUTATIONAL BIOLOGY | 2018年 / 14卷 / 12期

基金：

美国国家科学基金会;

关键词：

NEURAL-NETWORKS; RECOGNITION; REPRESENTATION; GRADIENT; SURFACE; COLOR; SET;

D O I：

10.1371/journal.pcbi.1006613

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Deep convolutional networks (DCNNs) are achieving previously unseen performance in object classification, raising questions about whether DCNNs operate similarly to human vision. In biological vision, shape is arguably the most important cue for recognition. We tested the role of shape information in DCNNs trained to recognize objects. In Experiment 1, we presented a trained DCNN with object silhouettes that preserved overall shape but were filled with surface texture taken from other objects. Shape cues appeared to play some role in the classification of artifacts, but little or none for animals. In Experiments 2-4, DCNNs showed no ability to classify glass figurines or outlines but correctly classified some silhouettes. Aspects of these results led us to hypothesize that DCNNs do not distinguish object's bounding contours from other edges, and that DCNNs access some local shape features, but not global shape. In Experiment 5, we tested this hypothesis with displays that preserved local features but disrupted global shape, and vice versa. With disrupted global shape, which reduced human accuracy to 28%, DCNNs gave the same classification labels as with ordinary shapes. Conversely, local contour changes eliminated accurate DCNN classification but caused no difficulty for human observers. These results provide evidence that DCNNs have access to some local shape information in the form of local edge relations, but they have no access to global object shapes.

引用

页数：43

共 50 条

[31] Using Convolutional Neural Networks to Classify Audio Signal in Noisy Sound Scenes
Gubin, M. V.
2018 GLOBAL SMART INDUSTRY CONFERENCE (GLOSIC), 2018,
[32] Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review
Rawat, Waseem
Wang, Zenghui
NEURAL COMPUTATION, 2017, 29 (09) : 2352 - 2449
[33] DeepChart: Combining deep convolutional networks and deep belief networks in chart classification
Tang, Binbin
Liu, Xiao
Lei, Jie
Song, Mingli
Tao, Dapeng
Sun, Shuifa
Dong, Fangmin
SIGNAL PROCESSING, 2016, 124 : 156 - 161
[34] Deep Vein: Novel Finger Vein Verification Methods Based on Deep Convolutional Neural Networks
Huang, Houjun
Liu, Shilei
Zheng, He
Ni, Liao
Zhang, Yi
Li, Wenxin
2017 IEEE INTERNATIONAL CONFERENCE ON IDENTITY, SECURITY AND BEHAVIOR ANALYSIS (ISBA), 2017,
[35] Object-Based Classification Framework of Remote Sensing Images With Graph Convolutional Networks
Zhang, Xiaodong
Tan, Xiaoliang
Chen, Guanzhou
Zhu, Kun
Liao, Puyun
Wang, Tong
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[36] Convolutional SVM Networks for Object Detection in UAV Imagery
Bazi, Yakoub
Melgani, Farid
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (06): : 3107 - 3118
[37] Object Shape and Orientation Do Not Routinely Influence Performance During Language Processing
Rommers, Joost
Meyer, Antje S.
Huettig, Falk
PSYCHOLOGICAL SCIENCE, 2013, 24 (11) : 2218 - 2225
[38] AMalNet: A deep learning framework based on graph convolutional networks for malware detection
Pei, Xinjun
Yu, Long
Tian, Shengwei
COMPUTERS & SECURITY, 2020, 93
[39] Automatic feature extraction and classification of Iberian ceramics based on deep convolutional networks
Cintas, Celia
Lucena, Manuel
Manuel Fuertes, Jose
Delrieux, Claudio
Navarro, Pablo
Gonzalez-Jose, Rolando
Molinos, Manuel
JOURNAL OF CULTURAL HERITAGE, 2020, 41 : 106 - 112
[40] Deep Convolutional Neural Networks for Large-scale Speech Tasks
Sainath, Tara N.
Kingsbury, Brian
Saon, George
Soltau, Hagen
Mohamed, Abdel-rahman
Dahl, George
Ramabhadran, Bhuvana
NEURAL NETWORKS, 2015, 64 : 39 - 48

← 1 2 3 4 5 →