A novel moving object segmentation framework utilizing camera motion recognition for H.264 compressed videos

被引:4
作者
Okade, Manish [1 ]
Biswas, Prabir Kumar [2 ]
机构
[1] Natl Inst Technol, Dept Elect & Commun Engn, Rourkela 769008, India
[2] Indian Inst Technol, Dept Elect & Elect Commun Engn, Kharagpur 721302, W Bengal, India
关键词
Block motion vectors; Camera motion; Moving object segmentation; Wavelet subbands; Neural network classifier; Graph cut; Compressed domain; Back-propagation training; SPATIOTEMPORAL SEGMENTATION; ENERGY MINIMIZATION; IMAGE SEGMENTATION; TRACKING; SEQUENCES; FIELD;
D O I
10.1016/j.jvcir.2016.01.016
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel coarse to fine moving object segmentation framework for H.264/AVC compressed videos. The proposed framework integrates the global motion estimation and global motion compensation steps in the segmentation pipeline unlike previous techniques which did not consider such an integration. The integration is based on testing for presence of global motion by classifying the interframe motion vectors into moving camera class and still camera class. The decision boundary separating these two classes is learnt from the training video data. The integration automates the moving object segmentation to be applicable for static, moving and combination of static/moving camera cases which to the best of our knowledge has not been carried out earlier. Further, a novel coarse segmentation technique is proposed by decomposing the inter-frame motion vectors into wavelet sub-bands and utilizing logical operations on LH, HL and HH sub-band wavelet coefficients. The premise is based on the fact that since the LH, HL and HH sub-bands contain the detail information pertaining to horizontal, vertical and diagonal moving blocks respectively, they can be exploited to identify the coarse moving boundaries. The coarse segmentation is fast in comparison to state-of-the-art coarse segmentation methods as demonstrated by our experiments. Finally, these coarse boundaries are modeled in an energy minimization framework and shown that by minimizing the energy using graph cut optimization the segmentation is refined to obtain the fine segmentation. The proposed framework is tested on a number of standard video sequences encoded with H.264/AVC JM encoder and comparison is carried out with state-of-the-art compressed domain moving object segmentation methods as well as with an existing state-of-the-art pixel domain method to establish and validate the proposed moving object segmentation framework. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:199 / 212
页数:14
相关论文
共 32 条
  • [11] CLOUTIER L, 1994, IEEE IMAGE PROC, P805, DOI 10.1109/ICIP.1994.413682
  • [12] Robust Background Subtraction for Network Surveillance in H.264 Streaming Video
    Dey, Bhaskar
    Kundu, Malay K.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (10) : 1695 - 1703
  • [13] Mean shift clustering-based moving object segmentation in the H.264 compressed domain
    Fei, W.
    Zhu, S.
    [J]. IET IMAGE PROCESSING, 2010, 4 (01) : 11 - 18
  • [14] Video Object Tracking in the Compressed Domain Using Spatio-Temporal Markov Random Fields
    Khatoonabadi, Sayed Hossein
    Bajic, Ivan V.
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (01) : 300 - 313
  • [15] What energy functions can be minimized via graph cuts?
    Kolmogorov, V
    Zabih, R
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (02) : 147 - 159
  • [16] Kowdle A, 2012, LECT NOTES COMPUT SC, V7574, P272, DOI 10.1007/978-3-642-33712-3_20
  • [17] Li F., 2013, ICCV
  • [18] Macroblock Classification Method for Video Applications Involving Motions
    Lin, Weiyao
    Sun, Ming-Ting
    Li, Hongxiang
    Chen, Zhenzhong
    Li, Wei
    Zhou, Bing
    [J]. IEEE TRANSACTIONS ON BROADCASTING, 2012, 58 (01) : 34 - 46
  • [19] Real-time spatiotemporal segmentation of video objects in the H.264 compressed domain
    Liu, Zhi
    Lu, Yu
    Zhang, Zhaoyang
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2007, 18 (03) : 275 - 290
  • [20] Okade M., 2015, IEEE T CIRC SYST VID