A comprehensive coarse-to-fine sports video analysis framework to infer 3D parameters of video objects with application to tennis video sequences

被引：0

作者：

Luo, Y ^{[1
]}

Hwang, JN ^{[1
]}

机构：

[1] Univ Washington, Dept Elect Engn, IPL, Seattle, WA 98195 USA

来源：

2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a novel video content analysis system. An innovative 2D to 3D parameter inference algorithm is presented. It is applied to the tennis player body shape modeling, after a coarse-to-fine analysis on real world sports video sequences. As the first step, the video shots are classified in coarse level. Only shots containing appropriate body shape size are retained for the fine-level analysis. The fine-level analysis begins with a video object (VO) segmentation stage to obtain the player body shapes. The VOs then undergo training and testing stages. The training VOs are classified into serving and non-serving classes by Gaussian mixture modeling (GMM). The VOs in serving class are further clustered and the corresponding 3D parameters of a human body model are obtained manually for each cluster center. For a testing VO sequence, the VOs that contain servings are found by GMM and the initial 3D parameters are fitted to the closest matches to the cluster centers. Based on the initial guess, an innovative multidimensional optimization procedure is employed to obtain the 3D parameters. Experiments are performed on broadcasted tennis games and promising results are obtained.

引用

页码：425 / 428

页数：4

共 17 条

[1]

Cheng N, 2002, MOL CANCER RES, V1, P2

[2]

Goncharov AB, 1995, MATH RES LETT, V2, P95

[3]

Howe N, 1999, NIPS 99

[4]

KARLIGA I, 2004, IEEE MMSP 04

[5] Fast and automatic video object segmentation and tracking for content-based applications [J].

Kim, C ;

Hwang, JN .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2002, 12 (02) :122-129

[6] Video object extraction for object-oriented applications [J].

Kim, C ;

Hwang, JN .

JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2001, 29 (1-2) :7-21

[7]

KOUBAROULIS D, 2002, ICPR 02

[8] A silhouette-based algorithm for texture registration and stitching [J].

Lensch, HPA ;

Heidrich, W ;

Seidel, HP .

GRAPHICAL MODELS, 2001, 63 (04) :245-262

[9]

LIN T, 2000, ICPR 00

[10] FITTING PARAMETERIZED 3-DIMENSIONAL MODELS TO IMAGES [J].

LOWE, DG .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1991, 13 (05) :441-450

← 1 2 →