Spatiotemporal segmentation and tracking of objects for visualization of videoconference image sequences

被引：40

作者：

Kompatsiaris, I ^{[1
]}

Strintzis, MG

机构：

[1] Aristotelian Univ Salonika, Dept Elect & Comp Engn, Informat Proc Lab, GR-54006 Salonika, Greece

[2] Ctr Res & Technol Hellas, Informat & Telemat Inst, Salonika 54639, Greece

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2000年 / 10卷 / 08期

关键词：

model-based image sequence analysis; spatiotemporal image sequence segmentation; VRML;

D O I：

10.1109/76.889030

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, a procedure is described for the segmentation, content-based coding, and visualization of videoconference image sequences. First, image sequence analysis is used to estimate the shape and motion parameters of the person facing the camera. A spatiotemporal filter, taking into account the intensity differences between consequent frames, is applied, in order to separate the moving person from the static background. The foreground is segmented in a number of regions in order to identify the face. For this purpose, we propose the novel procedure of K-Means with connectivity constraint algorithm as a general segmentation algorithm combining several types of information including intensity, motion and compactness. In this algorithm, the use of spatiotemporal regions is introduced since a number of frames are analyzed simultaneously and as a result, the same region is present in consequent frames. Based on this information, a 3-D ellipsoid is adapted to the person's face using an efficient and robust algorithm. The rigid 3-D motion is estimated next using a least median of squares approach. Finally, a Virtual Reality Modeling Language (VRML) file is created containing all the above information; this file may he viewed by using any VRML 2.0 compliant browser.

引用

页码：1388 / 1402

页数：15

共 46 条

[1] ACTON ST, 1994, IEEE INT C IMAGE PRO
[2] DETERMINING 3-DIMENSIONAL MOTION AND STRUCTURE FROM OPTICAL-FLOW GENERATED BY SEVERAL MOVING-OBJECTS
ADIV, G
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1985, 7 (04) : 384 - 401
[3] ALATAN AA, 1998, IEEE T CIRCUITS SYST, V8, P19
[4] [Anonymous], IMAGE COMMUN
[5] AYER S, 1995, THESIS ECOLE POLYTEC
[6] BASU S, 1996, P INT C PATT REC 95
[7] Video segmentation based on multiple features for interactive multimedia applications
Castagno, R
Ebrahimi, T
Kunt, M
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (05) : 562 - 571
[8] MPEG and multimedia communications
Chiariglione, L
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1997, 7 (01) : 5 - 18
[9] FITZGIBBON AW, 1996, P INT C PATTERN RECO
[10] A SURVEY ON IMAGE SEGMENTATION
FU, KS
MUI, JK
[J]. PATTERN RECOGNITION, 1981, 13 (01) : 3 - 16

← 1 2 3 4 5 →