Compressed Domain Video Object Segmentation

被引:28
作者
Porikli, Fatih [1 ]
Bashir, Faisal [1 ]
Sun, Huifang [1 ]
机构
[1] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA
关键词
Compressed domain segmentation; mean-shift analysis; MPEG video; volume growing; SPATIOTEMPORAL SEGMENTATION; SEARCH ALGORITHM; MOTION;
D O I
10.1109/TCSVT.2009.2020253
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a compressed domain video object segmentation method for the MPEG encoded video sequences. For a fraction of the raw domain analysis, compressed domain segmentation provides the essential a priori information to many vision tasks from surveillance to transcoding that require fast processing of large volumes of data where pixel-resolution boundary extraction is not required. Our method generates accurate segmentation maps in block resolution at hierarchically varying object levels, which empowers application to determine the most pertinent partition of images. It exploits the block structure of the compressed video to minimize the amount of data to be processed. All the available motion flow within a group of pictures is projected onto a single layer, which also consists of the frequency decomposition of color pattern. Then, by starting from the blocks where the spatial energy is small, it expands homogeneous regions while automatically adapting local similarity criteria. We also formulate an alternative solution that applies a kernel-based clustering where separate spatial, transform, and motion kernels are used to establish the affinity. We show that both region expansion and mean shift produce similar results as the computationally expensive raw domain segmentation. Finally, a binary clustering iteratively merges the most similar regions to generate a hierarchical partition tree.
引用
收藏
页码:2 / 14
页数:13
相关论文
共 25 条
[1]   Compressed domain video retrieval using object and global motion descriptors [J].
Babu, R. Venkatesh ;
Ramakrishnan, K. R. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2007, 32 (01) :93-113
[2]   Video object segmentation: A compressed domain approach [J].
Babu, RV ;
Ramakrishnan, KR ;
Srinivasan, SH .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (04) :462-474
[3]   MOTION SEGMENTATION AND QUALITATIVE DYNAMIC SCENE ANALYSIS FROM AN IMAGE SEQUENCE [J].
BOUTHEMY, P ;
FRANCOIS, E .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1993, 10 (02) :157-182
[4]   Mean shift: A robust approach toward feature space analysis [J].
Comaniciu, D ;
Meer, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) :603-619
[5]  
Comaniciu D., 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision, P1197, DOI 10.1109/ICCV.1999.790416
[6]  
DEQUEIROZ R, 2000, IEEE T IMAGE PROCESS
[7]   THE CROSS-SEARCH ALGORITHM FOR MOTION ESTIMATION [J].
GHANBARI, M .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1990, 38 (07) :950-953
[8]   DETERMINING OPTICAL-FLOW [J].
HORN, BKP ;
SCHUNCK, BG .
ARTIFICIAL INTELLIGENCE, 1981, 17 (1-3) :185-203
[9]  
Ji S, 1998, 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, P80, DOI 10.1109/ICIP.1998.723425
[10]  
KOBLA V, 1996, CARTR839 CFAR