On Human-like Biases in Convolutional Neural Networks for the Perception of Slant from Texture

被引:0
作者
Wang, Yuanhao [1 ]
Zhang, Qian [1 ]
Aubuchon, Celine [2 ]
Kemp, Jovan [2 ]
Domini, Fulvio [2 ]
Tompkin, James [1 ]
机构
[1] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA
[2] Brown Univ, Dept Cognit Linguist & Psychol Sci, Providence, RI 02912 USA
关键词
Perception; slant; texture; convolutional neural networks; deep learning; SHAPE; CUES;
D O I
10.1145/3613451
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Depth estimation is fundamental to 3D perception, and humans are known to have biased estimates of depth. This study investigates whether convolutional neural networks (CNNs) can be biased when predicting the sign of curvature and depth of surfaces of textured surfaces under different viewing conditions (field of view) and surface parameters (slant and texture irregularity). This hypothesis is drawn fromthe idea that texture gradients described by local neighborhoods-a cue identified in human vision literature-are also representable within convolutional neural networks. To this end, we trained both unsupervised and supervised CNN models on the renderings of slanted surfaces with random Polka dot patterns and analyzed their internal latent representations. The results show that the unsupervisedmodels have similar prediction biases as humans across all experiments, while supervised CNN models do not exhibit similar biases. The latent spaces of the unsupervised models can be linearly separated into axes representing field of view and optical slant. For supervised models, this ability varies substantially with model architecture and the kind of supervision (continuous slant vs. sign of slant). Even though this study says nothing of any shared mechanism, these findings suggest that unsupervised CNN models can share similar predictions to the human visual system. Code: github.com/brownvc/Slant-CNN-Biases
引用
收藏
页数:18
相关论文
共 27 条
[1]  
Islam MA, 2021, Arxiv, DOI arXiv:2101.11604
[2]   Explicit and implicit depth-cue integration: Evidence of systematic biases with real objects [J].
Campagnoli, Carlo ;
Hung, Bethany ;
Domini, Fulvio .
VISION RESEARCH, 2022, 190
[3]   3-D structure perceived from dynamic information: a new theory [J].
Domini, F ;
Caudek, C .
TRENDS IN COGNITIVE SCIENCES, 2003, 7 (10) :444-449
[4]  
Geirhos R, 2019, Arxiv, DOI [arXiv:1811.12231, DOI 10.48550/ARXIV.1811.12231]
[5]   THE PERCEPTION OF VISUAL SURFACES [J].
GIBSON, JJ .
AMERICAN JOURNAL OF PSYCHOLOGY, 1950, 63 (03) :367-384
[6]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[7]   SYSTEMATIC DISTORTIONS OF SHAPE FROM STEREOPSIS [J].
JOHNSTON, EB .
VISION RESEARCH, 1991, 31 (7-8) :1351-1360
[8]  
Kubilius J, 2019, ADV NEUR IN, V32
[9]   Are blur and disparity complementary cues to depth? [J].
Langer, Michael S. ;
Siciliano, Ryan A. .
VISION RESEARCH, 2015, 107 :15-21
[10]   Convolutional Neural Networks as a Model of the Visual System: Past, Present, and Future [J].
Lindsay, Grace W. .
JOURNAL OF COGNITIVE NEUROSCIENCE, 2021, 33 (10) :2017-2031