A summary-statistic representation in peripheral vision explains visual crowding

被引:246
作者
Balas, Benjamin [1 ]
Nakano, Lisa [2 ]
Rosenholtz, Ruth [2 ]
机构
[1] Childrens Hosp, Labs Cognit Neurosci, Boston, MA 02115 USA
[2] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA
关键词
peripheral vision; crowding; texture perception; texture synthesis; computational model; LATERAL MASKING; TEXTURE; SIMILARITY; RECOGNITION; MODEL; IDENTIFICATION; SENSITIVITY; ORIENTATION; PERCEPTION; RESOLUTION;
D O I
10.1167/9.12.13
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
Peripheral vision provides a less faithful representation of the visual input than foveal vision. Nonetheless, we can gain a lot of information about the world from our peripheral vision, for example in order to plan eye movements. The phenomenon of crowding shows that the reduction of information available in the periphery is not merely the result of reduced resolution. Crowding refers to visual phenomena in which identification of a target stimulus is significantly impaired by the presence of nearby stimuli, or flankers. What information is available in the periphery? We propose that the visual system locally represents peripheral stimuli by the joint statistics of responses of cells sensitive to different position, phase, orientation, and scale. This "textural" representation by summary statistics predicts the subjective "jumble" of features often associated with crowding. We show that the difficulty of performing an identification task within a single pooling region using this representation of the stimuli is correlated with peripheral identification performance under conditions of crowding. Furthermore, for a simple stimulus with no flankers, this representation can be adequate to specify the stimulus with some position invariance. This provides evidence that a unified neuronal mechanism may underlie peripheral vision, ordinary pattern recognition in central vision, and texture perception. A key component of our methodology involves creating visualizations of the information available in the summary statistics of a stimulus. We call these visualizations "mongrels" and show that they are highly useful in examining how the early visual system represents the visual input. Mongrels enable one to study the "equivalence classes" of our model, i.e., the sets of stimuli that map to the same representation according to the model.
引用
收藏
页数:18
相关论文
共 49 条
[1]   ECCENTRIC VISION - ADVERSE INTERACTIONS BETWEEN LINE SEGMENTS [J].
ANDRIESSEN, JJ ;
BOUMA, H .
VISION RESEARCH, 1976, 16 (01) :71-78
[2]  
[Anonymous], 1966, Textures: a photographic album for artists and designers
[3]   CHART DEMONSTRATING VARIATIONS IN ACUITY WITH RETINAL POSITION [J].
ANSTIS, SM .
VISION RESEARCH, 1974, 14 (07) :589-592
[4]   Texture synthesis and perception: Using computational models to study texture representations in the human visual system [J].
Balas, BJ .
VISION RESEARCH, 2006, 46 (03) :299-309
[5]   TEXTURAL SEGMENTATION, 2ND-ORDER STATISTICS, AND TEXTURAL ELEMENTS [J].
BECK, J .
BIOLOGICAL CYBERNETICS, 1983, 48 (02) :125-130
[6]   Computerized simulation of color appearance for dichromats [J].
Brettel, H ;
Vienot, F ;
Mollon, JD .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1997, 14 (10) :2647-2655
[7]   The three dimensions of human visual sensitivity to first-order contrast statistics [J].
Chubb, Charles ;
Nam, Jong-Ho ;
Bindman, Daniel R. ;
Sperling, George .
VISION RESEARCH, 2007, 47 (17) :2237-2248
[8]   Spatial-frequency and contrast properties of crowding [J].
Chung, STL ;
Levi, DM ;
Legge, GE .
VISION RESEARCH, 2001, 41 (14) :1833-1850
[9]   Illusory conjunctions are an illusion: The effects of target-nontarget similarity on conjunction and feature errors [J].
Donk, M .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1999, 25 (05) :1207-1233
[10]   RETINAL STIMULATION BY LIGHT SUBSTITUTION [J].
DONNER, KO ;
RUSHTON, WAH .
JOURNAL OF PHYSIOLOGY-LONDON, 1959, 149 (02) :288-302