Joint segmentation of collectively moving objects using a bag-of-words model and level set evolution

被引:14
作者
Wu, Si [1 ]
Wong, Hau San [1 ,2 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Ctr Innovat Applicat Internet & Multimedia Techno, Kowloon, Hong Kong, Peoples R China
关键词
Collective motion; Segmentation; Bag-of-words; Level set; MOTION SEGMENTATION; ACTIVE CONTOURS; COMPETITION;
D O I
10.1016/j.patcog.2012.03.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In scenes with collectively moving objects, to disregard the individual objects and take the entire group into consideration for motion characterization is a promising approach with wide application prospects. In contrast to studies on the segmentation of independently moving objects, our purpose is to construct a segmentation of these objects to characterize their motions at a macroscopic level. In general, the collectively moving objects in a group have very similar motion behavior with their neighbors and appear as a kind of global collective motion. This paper presents a joint segmentation approach for these collectively moving objects. In our model, we extract these macroscopic movement patterns based on optical flow field sequences. Specifically, a group of collectively moving objects correspond to a region where the optical flow field has high magnitude and high local direction coherence. As a result, our problem can be addressed by identifying these coherent optical flow field regions. The segmentation is performed through the minimization of a variational energy functional derived from the Bayes classification rule. Specifically, we use a bag-of-words model to generate a codebook as a collection of prototypical optical flow patterns, and the class-conditional probability density functions for different regions are determined based on these patterns. Finally, the minimization of our proposed energy functional results in the gradient descent evolution of segmentation boundaries which are implicitly represented through level sets. The application of our proposed approach is to segment and track multiple groups of collectively moving objects in a large variety of real-world scenes. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3389 / 3401
页数:13
相关论文
共 43 条
[31]   Definition and properties of Lagrangian coherent structures from finite-time Lyapunov exponents in two-dimensional aperiodic flows [J].
Shadden, SC ;
Lekien, F ;
Marsden, JE .
PHYSICA D-NONLINEAR PHENOMENA, 2005, 212 (3-4) :271-304
[32]   A MAP approach for joint motion estimation, segmentation, and super resolution [J].
Shen, Huanfeng ;
Zhang, Liangpei ;
Huang, Bo ;
Li, Pingxiang .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (02) :479-490
[33]   Normalized cuts and image segmentation [J].
Shi, JB ;
Malik, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) :888-905
[34]   Video Segmentation Based on Motion Coherence of Particles in a Video Sequence [J].
Silva, Luciano S. ;
Scharcanski, Jacob .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (04) :1036-1049
[35]   Crowd Analysis Using Computer Vision Techniques [A survey] [J].
Silveira Jacques, Julio Cezar, Jr. ;
Musse, Soraia Raupp ;
Jung, Claudio Rosito .
IEEE SIGNAL PROCESSING MAGAZINE, 2010, 27 (05) :66-77
[36]   Multibody Structure-and-Motion Segmentation by Branch-and-Bound Model Selection [J].
Thakoor, Ninad ;
Gao, Jean ;
Devarajan, Venkat .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (06) :1393-1402
[37]   A shape-based approach to the segmentation of medical imagery using level sets [J].
Tsai, A ;
Yezzi, A ;
Wells, W ;
Tempany, C ;
Tucker, D ;
Fan, A ;
Grimson, WE ;
Willsky, A .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2003, 22 (02) :137-154
[38]   Joint multiregion segmentation and parametric estimation of image motion by basis function representation and level set evolution [J].
Vázquez, C ;
Mitiche, A ;
Laganière, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (05) :782-793
[39]   A multiphase level set framework for image segmentation using the Mumford and Shah model [J].
Vese, LA ;
Chan, TF .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 50 (03) :271-293
[40]   A unified algebraic approach to 2-D and 3-D motion segmentation and estimation [J].
Vidal, Rene ;
Ma, Yi .
JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2006, 25 (03) :403-421