Deep convolutional networks do not classify based on global object shape

被引:207
作者
Baker, Nicholas [1 ]
Lu, Hongjing [1 ]
Erlikhman, Gennady [1 ,2 ]
Kellman, Philip J. [1 ]
机构
[1] Univ Calif Los Angeles, Dept Psychol, Los Angeles, CA 90095 USA
[2] Univ Nevada, Reno, NV 89557 USA
基金
美国国家科学基金会;
关键词
NEURAL-NETWORKS; RECOGNITION; REPRESENTATION; GRADIENT; SURFACE; COLOR; SET;
D O I
10.1371/journal.pcbi.1006613
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Deep convolutional networks (DCNNs) are achieving previously unseen performance in object classification, raising questions about whether DCNNs operate similarly to human vision. In biological vision, shape is arguably the most important cue for recognition. We tested the role of shape information in DCNNs trained to recognize objects. In Experiment 1, we presented a trained DCNN with object silhouettes that preserved overall shape but were filled with surface texture taken from other objects. Shape cues appeared to play some role in the classification of artifacts, but little or none for animals. In Experiments 2-4, DCNNs showed no ability to classify glass figurines or outlines but correctly classified some silhouettes. Aspects of these results led us to hypothesize that DCNNs do not distinguish object's bounding contours from other edges, and that DCNNs access some local shape features, but not global shape. In Experiment 5, we tested this hypothesis with displays that preserved local features but disrupted global shape, and vice versa. With disrupted global shape, which reduced human accuracy to 28%, DCNNs gave the same classification labels as with ordinary shapes. Conversely, local contour changes eliminated accurate DCNN classification but caused no difficulty for human observers. These results provide evidence that DCNNs have access to some local shape information in the form of local edge relations, but they have no access to global object shapes.
引用
收藏
页数:43
相关论文
共 50 条
  • [31] Using Convolutional Neural Networks to Classify Audio Signal in Noisy Sound Scenes
    Gubin, M. V.
    2018 GLOBAL SMART INDUSTRY CONFERENCE (GLOSIC), 2018,
  • [32] Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review
    Rawat, Waseem
    Wang, Zenghui
    NEURAL COMPUTATION, 2017, 29 (09) : 2352 - 2449
  • [33] DeepChart: Combining deep convolutional networks and deep belief networks in chart classification
    Tang, Binbin
    Liu, Xiao
    Lei, Jie
    Song, Mingli
    Tao, Dapeng
    Sun, Shuifa
    Dong, Fangmin
    SIGNAL PROCESSING, 2016, 124 : 156 - 161
  • [34] Deep Vein: Novel Finger Vein Verification Methods Based on Deep Convolutional Neural Networks
    Huang, Houjun
    Liu, Shilei
    Zheng, He
    Ni, Liao
    Zhang, Yi
    Li, Wenxin
    2017 IEEE INTERNATIONAL CONFERENCE ON IDENTITY, SECURITY AND BEHAVIOR ANALYSIS (ISBA), 2017,
  • [35] Object-Based Classification Framework of Remote Sensing Images With Graph Convolutional Networks
    Zhang, Xiaodong
    Tan, Xiaoliang
    Chen, Guanzhou
    Zhu, Kun
    Liao, Puyun
    Wang, Tong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [36] Convolutional SVM Networks for Object Detection in UAV Imagery
    Bazi, Yakoub
    Melgani, Farid
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (06): : 3107 - 3118
  • [37] Object Shape and Orientation Do Not Routinely Influence Performance During Language Processing
    Rommers, Joost
    Meyer, Antje S.
    Huettig, Falk
    PSYCHOLOGICAL SCIENCE, 2013, 24 (11) : 2218 - 2225
  • [38] AMalNet: A deep learning framework based on graph convolutional networks for malware detection
    Pei, Xinjun
    Yu, Long
    Tian, Shengwei
    COMPUTERS & SECURITY, 2020, 93
  • [39] Automatic feature extraction and classification of Iberian ceramics based on deep convolutional networks
    Cintas, Celia
    Lucena, Manuel
    Manuel Fuertes, Jose
    Delrieux, Claudio
    Navarro, Pablo
    Gonzalez-Jose, Rolando
    Molinos, Manuel
    JOURNAL OF CULTURAL HERITAGE, 2020, 41 : 106 - 112
  • [40] Deep Convolutional Neural Networks for Large-scale Speech Tasks
    Sainath, Tara N.
    Kingsbury, Brian
    Saon, George
    Soltau, Hagen
    Mohamed, Abdel-rahman
    Dahl, George
    Ramabhadran, Bhuvana
    NEURAL NETWORKS, 2015, 64 : 39 - 48