Motion analysis and segmentation through spatio-temporal slices processing

被引:87
作者
Ngo, CW [1 ]
Pong, TC
Zhang, HJ
机构
[1] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
[3] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
motion segmentation; spatio-temporal slices; tensor histogram;
D O I
10.1109/TIP.2003.809020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents new approaches in characterizing and segmenting the content of video. These approaches are developed based upon the pattern analysis of spatio-temporal slices. While traditional approaches to motion sequence analysis tend to formulate computational methodologies on two or three adjacent frames, spatio-temporal slices provide rich visual patterns along a larger temporal scale. In this paper, we first describe a motion computation method based on a structure tensor formulation. This method encodes visual patterns of spatio-temporal slices in a tensor histogram, on one hand, characterizing the temporal changes of motion over time, on the other hand, describing the motion trajectories of different moving objects. By analyzing the tensor histogram of an image sequence, we can temporally segment the sequence into several motion coherent subunits, in addition, spatially segment the sequence into various motion layers. The temporal segmentation of image sequences expeditiously facilitates the motion annotation and content representation of a video, while the spatial decomposition of image sequences leads to a prominent way of reconstructing background panoramic images and computing foreground objects.
引用
收藏
页码:341 / 355
页数:15
相关论文
共 34 条
[1]   SPATIOTEMPORAL ENERGY MODELS FOR THE PERCEPTION OF MOTION [J].
ADELSON, EH ;
BERGEN, JR .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1985, 2 (02) :284-299
[2]  
AKUTSU A, 1995, P INT C IMAGE PROCES, V1, P330
[3]  
[Anonymous], 1993, THESIS MIT
[4]  
AYER S, 1994, EUR C COMP VIS
[5]   EPIPOLAR-PLANE IMAGE-ANALYSIS - AN APPROACH TO DETERMINING STRUCTURE FROM MOTION [J].
BOLLES, RC ;
BAKER, HH ;
MARIMONT, DH .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1987, 1 (01) :7-55
[6]   A unified approach to shot change detection and camera motion characterization [J].
Bouthemy, P ;
Gelgon, M ;
Ganansia, F .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1999, 9 (07) :1030-1044
[7]   COOPERATIVE ROBUST ESTIMATION USING LAYERS OF SUPPORT [J].
DARRELL, T ;
PENTLAND, AP .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (05) :474-487
[8]  
Granlund G, 1995, SIGNAL PROCESSING CO
[9]   Spatial color indexing and applications [J].
Huang, J ;
Kumar, SR ;
Mitra, M ;
Zhu, WJ ;
Zabih, R .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1999, 35 (03) :245-268
[10]   COMPUTING OCCLUDING AND TRANSPARENT MOTIONS [J].
IRANI, M ;
ROUSSO, B ;
PELEG, S .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1994, 12 (01) :5-16