Global vs. local processing of compressed representations: A computational model of visual search

被引：1

作者：

Cohen, E ^{[1
]}

Levy, N

Ruppin, E

机构：

[1] Tel Aviv Univ, Dept Psychol, IL-69978 Tel Aviv, Israel

[2] Tel Aviv Univ, Sch Phys, IL-69978 Tel Aviv, Israel

[3] Tel Aviv Univ, Dept Comp Sci, IL-69978 Tel Aviv, Israel

[4] Tel Aviv Univ, Dept Physiol, IL-69978 Tel Aviv, Israel

来源：

NEUROCOMPUTING | 2000年 / 32卷

关键词：

attention; compression; natural-scenes; PCA; visual-search;

D O I：

10.1016/S0925-2312(00)00230-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A novel computational model of a pre-attentive system performing visual-search is presented. The model processes various types of displays, reproduced from three sources of visual-search experimental data: Duncan and Humphreys, Psychol. Rev. 96 (1989) 453-458, Treisman and Sate, J. Exp. Psychol. 16(1990) 459-478, and Wolfe, Friedman-Hill, Stewart, O'Connell, J. Exp. Psychol. 18 (1992) 34-49. The response-time-slopes measured in these experiments suggest that some of the displays are searched serially while others are scanned in parallel. Our model operates in two phases. First, the visual-search displays are compressed to overcome assumed biological capacity limitations. Compression is achieved by projecting the tasks' displays on a small set of feature maps. These features have been extracted from a large set of natural images by means of principal component analysis. Second, the compressed representations are further processed to identify a target in the display. The model succeeds in fast detection of targets in experimentally labeled parallel displays, but fails with serial ones. Analysis of the compressed representations reveals that compressed parallel displays contain global information that enables instantaneous target detection. However, in serial displays' representations, this global information is obscure and hence, a target detection system should resort to a serial, attentional scan of local features across the display. Our analysis provides a numerical criterion that is strongly correlated with the experimental response-time-slopes. It also provides new insight to the mechanisms of visual-attention, suggesting a self-organized representation of Treisman's feature maps, which may be implemented in other paradigms in the held. (C) 2000 Elsevier Science B.V. All rights reserved.

引用

页码：667 / 671

页数：5

共 7 条

[1] From parallel to serial processing: A computational study of visual search [J].