Understanding Tools: Task-Oriented Object Modeling, Learning and Recognition

被引：0

作者：

Zhu, Yixin ^{[1
]}

Zhao, Yibiao ^{[1
]}

Zhu, Song-Chun ^{[1
]}

机构：

[1] Univ Calif Los Angeles, Ctr Vis Cognit Learning & Art, Los Angeles, CA 90095 USA

来源：

2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2015年

关键词：

AFFORDANCES; GEOMETRY;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a new framework - task-oriented modeling, learning and recognition which aims at understanding the underlying functions, physics and causality in using objects as "tools". Given a task, such as, cracking a nut or painting a wall, we represent each object, e.g. a hammer or brush, in a generative spatio-temporal representation consisting of four components: i) an affordance basis to be grasped by hand; ii) a functional basis to act on a target object (the nut), iii) the imagined actions with typical motion trajectories; and iv) the underlying physical concepts, e.g. force, pressure, etc. In a learning phase, our algorithm observes only one RGB-D video, in which a rational human picks up one object (i. e. tool) among a number of candidates to accomplish the task. From this example, our algorithm learns the essential physical concepts in the task (e.g. forces in cracking nuts). In an inference phase, our algorithm is given a new set of objects (daily objects or stones), and picks the best choice available together with the inferred affordance basis, functional basis, imagined human actions (sequence of poses), and the expected physical quantity that it will produce. From this new perspective, any objects can be viewed as a hammer or a shovel, and object recognition is not merely memorizing typical appearance examples for each category but reasoning the physical mechanisms in various tasks to achieve generalization.

引用

页码：2855 / 2864

页数：10

共 52 条

[1]

[Anonymous], 2003, COGNITION TOOL USE F

[2]

[Anonymous], 2002, P ACM SIGKDD KDD 200, DOI 10.1145/775047.775067

[3]

[Anonymous], 2011, Animal tool behavior: the use and manufacture of tools by animals, DOI DOI 10.1353/BOOK.98237

[4]

[Anonymous], 2011, BMVC

[5] A survey of robot learning from demonstration [J].

Argall, Brenna D. ;

Chernova, Sonia ;

Veloso, Manuela ;

Browning, Brett .

ROBOTICS AND AUTONOMOUS SYSTEMS, 2009, 57 (05) :469-483

[6] Simulation as an engine of physical scene understanding [J].

Battaglia, Peter W. ;

Hamrick, Jessica B. ;

Tenenbaum, Joshua B. .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (45) :18327-18332

[7]

Beck BB, 1980, Animal tool behavior: The use and manufacture of tools by animals

[8]

Byrne R., 1989, MACHIAVELLIAN INTELL

[9] Understanding Indoor Scenes using 3D Geometric Phrases [J].

Choi, Wongun ;

Chao, Yu-Wei ;

Pantofaru, Caroline ;

Savarese, Silvio .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :33-40

[10] Neural representations of graspable objects: are tools special? [J].

Creem-Regehr, SH ;

Lee, JN .

COGNITIVE BRAIN RESEARCH, 2005, 22 (03) :457-469

← 1 2 3 4 5 6 →