A neural model of visual figure-ground segregation from kinetic occlusion

被引:12
作者
Barnes, Timothy [1 ]
Mingolla, Ennio [2 ]
机构
[1] Boston Univ, Ctr Computat Neurosci & Neural Technol CompNet, Boston, MA 02215 USA
[2] Northeastern Univ, Dept Speech Language Pathol & Audiol, Boston, MA 02115 USA
基金
美国国家科学基金会;
关键词
Visual cortex; Visual motion; Accretion and deletion; Figure-ground; Occlusion; Neural model; CORTICAL DYNAMICS; BRIGHTNESS PERCEPTION; RECEPTIVE-FIELDS; MOTION PARALLAX; OBJECT SEGMENTATION; 3-DIMENSIONAL FORM; DEFINED BOUNDARIES; DEPTH-ORDER; AREA MT; CORTEX;
D O I
10.1016/j.neunet.2012.09.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Freezing is an effective defense strategy for some prey, because their predators rely on visual motion to distinguish objects from their surroundings. An object moving over a background progressively covers (deletes) and uncovers (accretes) background texture while simultaneously producing discontinuities in the optic flow field. These events unambiguously specify kinetic occlusion and can produce a crisp edge, depth perception, and figure-ground segmentation between identically textured surfaces percepts which all disappear without motion. Given two abutting regions of uniform random texture with different motion velocities, one region appears to be situated farther away and behind the other (i.e., the ground) if its texture is accreted or deleted at the boundary between the regions, irrespective of region and boundary velocities. Consequently, a region with moving texture appears farther away than a stationary region if the boundary is stationary, but it appears closer (i.e., the figure) if the boundary is moving coherently with the moving texture. A computational model of visual areas V1 and V2 shows how interactions between orientation- and direction-selective cells first create a motion-defined boundary and then signal kinetic occlusion at that boundary. Activation of model occlusion detectors tuned to a particular velocity results in the model assigning the adjacent surface with a matching velocity to the far depth. A weak speed-depth bias brings faster-moving texture regions forward in depth in the absence of occlusion (shearing motion). These processes together reproduce human psychophysical reports of depth ordering for key cases of kinetic occlusion displays. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:141 / 162
页数:22
相关论文
共 66 条
[1]   SPATIOTEMPORAL ENERGY MODELS FOR THE PERCEPTION OF MOTION [J].
ADELSON, EH ;
BERGEN, JR .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1985, 2 (02) :284-299
[2]   Self-motion-induced eye movements: Effects on visual acuity and navigation [J].
Angelaki, DE ;
Hess, BJM .
NATURE REVIEWS NEUROSCIENCE, 2005, 6 (12) :966-976
[3]   MECHANISM OF DIRECTIONALLY SELECTIVE UNITS IN RABBITS RETINA [J].
BARLOW, HB ;
LEVICK, WR .
JOURNAL OF PHYSIOLOGY-LONDON, 1965, 178 (03) :477-&
[4]  
Barnes T., 2012, J COMPUTATIONAL NEUR
[5]   Figure-ground segregation at contours: A neural mechanism in the visual cortex of the alert monkey [J].
Baumann, R ;
vanderZwan, R ;
Peterhans, E .
EUROPEAN JOURNAL OF NEUROSCIENCE, 1997, 9 (06) :1290-1303
[6]   Disambiguating visual motion by form-motion interaction - a computational model [J].
Bayerl, Pierre ;
Neumann, Heiko .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 72 (01) :27-45
[7]   Object Segmentation from Motion Discontinuities and Temporal Occlusions-A Biologically Inspired Model [J].
Beck, Cornelia ;
Ognibeni, Thilo ;
Neumann, Heiko .
PLOS ONE, 2008, 3 (11)
[8]   Laminar cortical dynamics of visual form and motion interactions during coherent object motion perception [J].
Berzhanskaya, J. ;
Grossberg, S. ;
Mingolla, E. .
SPATIAL VISION, 2007, 20 (04) :337-395
[9]   Structure and function of visual area MT [J].
Born, RT ;
Bradley, DC .
ANNUAL REVIEW OF NEUROSCIENCE, 2005, 28 :157-189
[10]   Cortical dynamics of navigation and steering in natural scenes: Motion-based object segmentation, heading, and obstacle avoidance [J].
Browning, N. Andrew ;
Grossberg, Stephen ;
Mingolla, Ennio .
NEURAL NETWORKS, 2009, 22 (10) :1383-1398