A bio-inspired method and system for visual object-based attention and segmentation

被引：6

作者：

Huber, David J. ^{[1
]}

Khosla, Deepak ^{[1
]}

机构：

[1] HRL Labs LLC, Malibu, CA USA

来源：

AUTOMATIC TARGET RECOGNITION XX; ACQUISITION, TRACKING, POINTING, AND LASER SYSTEMS TECHNOLOGIES XXIV; AND OPTICAL PATTERN RECOGNITION XXI | 2010年 / 7696卷

关键词：

attention; segmentation; bio-inspired; vision; feature-based saliency; object-based saliency; object recognition; MECHANISM; SEARCH; CORTEX;

D O I：

10.1117/12.850757

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

This paper describes a method and system of human-like attention and object segmentation in visual scenes that (1) attends to regions in a scene in their rank of saliency in the image, (2) extracts the boundary of an attended proto-object based on feature contours, and (3) can be biased to boost the attention paid to specific features in a scene, such as those of a desired target object in static and video imagery. The purpose of the system is to identify regions of a scene of potential importance and extract the region data for processing by an object recognition and classification algorithm. The attention process can be performed in a default, bottom-up manner or a directed, top-down manner which will assign a preference to certain features over others. One can apply this system to any static scene, whether that is a still photograph or imagery captured from video. We employ algorithms that are motivated by findings in neuroscience, psychology, and cognitive science to construct a system that is novel in its modular and stepwise approach to the problems of attention and region extraction, its application of a flooding algorithm to break apart an image into smaller proto-objects based on feature density, and its ability to join smaller regions of similar features into larger proto-objects. This approach allows many complicated operations to be carried out by the system in a very short time, approaching real-time. A researcher can use this system as a robust front-end to a larger system that includes object recognition and scene understanding modules; it is engineered to function over a broad range of situations and can be applied to any scene with minimal tuning from the user.

引用

页数：11

共 14 条

[1]

[Anonymous], 1996, CUCS00696 DEP COMP S

[2]

[Anonymous], 2006, P IEEE CS C COMP VIS

[3] COHERENT OSCILLATIONS - A MECHANISM OF FEATURE LINKING IN THE VISUAL-CORTEX - MULTIPLE ELECTRODE AND CORRELATION ANALYSES IN THE CAT [J].