A computational theory of visual receptive fields

被引：0

作者：

Tony Lindeberg

机构：

[1] KTH Royal Institute of Technology,Department of Computational Biology, School of Computer Science and Communication

来源：

Biological Cybernetics | 2013年 / 107卷

关键词：

Receptive field; Scale space; Gaussian derivative; Scale covariance ; Affine covariance; Galilean covariance; Illumination invariance; LGN; Primary visual cortex; Visual area V1; Functional model; Simple cell; Double-opponent cell; Complex cell; Vision; Theoretical neuroscience; Theoretical biology;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

A receptive field constitutes a region in the visual field where a visual cell or a visual operator responds to visual stimuli. This paper presents a theory for what types of receptive field profiles can be regarded as natural for an idealized vision system, given a set of structural requirements on the first stages of visual processing that reflect symmetry properties of the surrounding world. These symmetry properties include (i) covariance properties under scale changes, affine image deformations, and Galilean transformations of space–time as occur for real-world image data as well as specific requirements of (ii) temporal causality implying that the future cannot be accessed and (iii) a time-recursive updating mechanism of a limited temporal buffer of the past as is necessary for a genuine real-time system. Fundamental structural requirements are also imposed to ensure (iv) mutual consistency and a proper handling of internal representations at different spatial and temporal scales. It is shown how a set of families of idealized receptive field profiles can be derived by necessity regarding spatial, spatio-chromatic, and spatio-temporal receptive fields in terms of Gaussian kernels, Gaussian derivatives, or closely related operators. Such image filters have been successfully used as a basis for expressing a large number of visual operations in computer vision, regarding feature detection, feature classification, motion estimation, object recognition, spatio-temporal recognition, and shape estimation. Hence, the associated so-called scale-space theory constitutes a both theoretically well-founded and general framework for expressing visual operations. There are very close similarities between receptive field profiles predicted from this scale-space theory and receptive field profiles found by cell recordings in biological vision. Among the family of receptive field profiles derived by necessity from the assumptions, idealized models with very good qualitative agreement are obtained for (i) spatial on-center/off-surround and off-center/on-surround receptive fields in the fovea and the LGN, (ii) simple cells with spatial directional preference in V1, (iii) spatio-chromatic double-opponent neurons in V1, (iv) space–time separable spatio-temporal receptive fields in the LGN and V1, and (v) non-separable space–time tilted receptive fields in V1, all within the same unified theory. In addition, the paper presents a more general framework for relating and interpreting these receptive fields conceptually and possibly predicting new receptive field profiles as well as for pre-wiring covariance under scaling, affine, and Galilean transformations into the representations of visual stimuli. This paper describes the basic structure of the necessity results concerning receptive field profiles regarding the mathematical foundation of the theory and outlines how the proposed theory could be used in further studies and modelling of biological vision. It is also shown how receptive field responses can be interpreted physically, as the superposition of relative variations of surface structure and illumination variations, given a logarithmic brightness scale, and how receptive field measurements will be invariant under multiplicative illumination variations and exposure control mechanisms.

引用

页码：589 / 635

页数：46

共 229 条

[1] Adelson E(1985)Spatiotemporal energy models for the perception of motion J Opt Soc Am A2 284-299
[2] Bergen J(2000)Fingerprint enhancement by shape adaptation of scale-space operators with automatic scale-selection IEEE Trans Image Process 9 2027-2042
[3] Almansa A(1986)Uniqueness of the Gaussian kernel for scale-space filtering IEEE Trans Pattern Anal Mach Intell 8 3-26
[4] Lindeberg T(2006)‘Simplification’ of responses of complex cells in cat striate cortex; suppressive surrounds and ’feedback’ inactivation J Physiol 574 731-750
[5] Babaud J(2008)Speeded up robust features (SURF) Comput Vis Image Underst 110 346-359
[6] Witkin AP(1992)Orientation selectivity, preference and continuity in monkey striate cortex J Neurosci 12 3139-3161
[7] Baudin M(1991)Iso-orientation domains in cat visual cortex are arranged in pinwheel-like patterns Nature 353 429-431
[8] Duda RO(2005)The suppressive field of neurons in the lateral geniculate nucleus J Neurosci 25 10844-10856
[9] Bardy C(2009)Performance evaluation of local colour invariants Comput Vis Image Underst 113 48-62
[10] Huang JY(1981)Fast filter transforms for image processing Comput Vis Graph Image Process 16 20-51

← 1 2 3 4 5 6 7 8 9 10 →