EARLY PROCESSING OF VISUAL INFORMATION

被引:439
作者
MARR, D [1 ]
机构
[1] MIT, ARTIFICIAL INTELLIG LAB, CAMBRIDGE, MA 02139 USA
关键词
D O I
10.1098/rstb.1976.0090
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The 1st step of consequence in human visual information processing is to compute a primitive but rich description of the grey-level changes present in an image. The description is expressed in a vocabulary of kinds of intensity change (edge, shading-edge, extended-edge, line, blob, etc.). Modifying parameters are bound to the elements in the description, specifying their position, orientation, termination points, contrast, size and fuzziness. This description is obtained from the intensity array by fixed techniques, and it is called the primal sketch. For most images, the primal sketch is large and unwieldy. The 2nd important step in visual information processing is to group its contents in a way that is appropriate for later recognition. The necessary grouping of elements in the primal sketch may be achieved by a mechanism that has available the processes inferred from above, together with the ability to select items by 1st order discriminations acting on the elements'' parameters. Only occasionally do these mechanisms use downward-flowing information about the contents of the particular image being processed. Non-attentive vision is in practice implemented by these grouping operations and 1st order discriminations acting on the primal sketch. The class of computations so obtained differs slightly from the class of 2nd order operations on the intensity array. The extraction of a form from the primal sketch using these techniques amounts to the separation of figure from ground. Most of the separation can be carried out by using techniques that do not depend upon the particular image in question. Figure-ground separation can normally precede the description of the shape of the extracted form.
引用
收藏
页码:483 / +
页数:1
相关论文
共 35 条
[1]  
[Anonymous], 1969, Perceptrons
[2]  
[Anonymous], 1975, The psychology of computer vision
[3]  
[Anonymous], 1965, Mach Bands: Quantitative Studies on Neural Networks in the Retina
[4]  
BAJCSY R, 1972, AI180 STANF LAB MEM
[5]   ACTION POTENTIALS FROM THE FROGS RETINA [J].
BARLOW, HB .
JOURNAL OF PHYSIOLOGY-LONDON, 1953, 119 (01) :58-68
[6]  
Brodatz P., 1966, TEXTURES PHOTOGRAPHI
[7]  
FREUDER EC, 1974, MITAI345 LAB TECHN R
[8]   MASKING IN VISUAL RECOGNITION - EFFECTS OF 2-DIMENSIONAL FILTERED NOISE [J].
HARMON, LD ;
JULESZ, B .
SCIENCE, 1973, 180 (4091) :1194-1197
[9]  
HERSKOVITS A, 1970, MITAI183 LAB MEM
[10]  
HORN BKP, 1973, MITAI285 LAB MEM