Variational Layered Dynamic Textures

被引:0
作者
Chan, Antoni B. [1 ]
Vasconcelos, Nuno [1 ]
机构
[1] Univ Calif San Diego, Dept Elect & Comp Engn, San Diego, CA 92103 USA
来源
CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4 | 2009年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The layered dynamic texture (LDT) is a generative model, which represents video as a collection of stochastic layers of different appearance and dynamics. Each layer is modeled as a temporal texture sampled from a different linear dynamical system, with regions of the video assigned to a layer using a Markov random field. Model parameters are learned from training video using the EM algorithm. However, exact inference for the E-step is intractable. In this paper, we propose a variational approximation for the LDT that enables efficient learning of the model. We also propose a temporally-switching LDT (TS-LDT), which allows the layer shape to change over time, along with the associated EM algorithm and variational approximation. The ability of the LDT to segment video into layers of coherent appearance and dynamics is also extensively evaluated, on both synthetic and natural video. These experiments show that the model possesses an ability to group regions of globally homogeneous, but locally heterogeneous, stochastic dynamics currently unparalleled in the literature.
引用
收藏
页码:1062 / 1069
页数:8
相关论文
共 24 条
[1]   PERFORMANCE OF OPTICAL-FLOW TECHNIQUES [J].
BARRON, JL ;
FLEET, DJ ;
BEAUCHEMIN, SS .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1994, 12 (01) :43-77
[2]  
BESAG J, 1974, J ROY STAT SOC B MET, V36, P192
[3]  
BISHOP C. M, 2006, Pattern Recognition and Machine Learning. Information Science and Statistics, DOI [10.1007/978-0-387-45528-0, DOI 10.1007/978-0-387-45528-0]
[4]  
Chan A., 2006, Advances in Neural Information Processing Systems, V18, P203
[5]  
CHAN AB, 2009, SVCLTR200901
[6]   Modeling, clustering, and segmenting video with mixtures of dynamic textures [J].
Chan, Antoni B. ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (05) :909-926
[7]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[8]  
Doretto G, 2003, NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, P1236
[9]   Dynamic textures [J].
Doretto, G ;
Chiuso, A ;
Wu, YN ;
Soatto, S .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2003, 51 (02) :91-109
[10]  
FREY BJ, 1999, C COMP VIS PAT REC, P416