Bilayer Sparse Topic Model for Scene Analysis in Imbalanced Surveillance Videos

被引：13

作者：

Wang, Jinqiao ^{[1
]}

Fu, Wei ^{[2
]}

Lu, Hanqing ^{[1
]}

Ma, Songde ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China

[2] China Elect Technol Grp Corp, Res Inst 54, Shijiazhuang 050081, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2014年 / 23卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Dynamic scene analysis; sparse coding; topic model;

D O I：

10.1109/TIP.2014.2363408

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Dynamic scene analysis has become a popular research area especially in video surveillance. The goal of this paper is to mine semantic motion patterns and detect abnormalities deviating from normal ones occurring in complex dynamic scenarios. To address this problem, we propose a data-driven and scene-independent approach, namely, Bilayer sparse topic model (BiSTM), where a given surveillance video is represented by a word-document hierarchical generative process. In this BiSTM, motion patterns are treated as latent topics sparsely distributed over low-level motion vectors, whereas a video clip can be sparsely reconstructed by a mixture of topics (motion pattern). In addition to capture the characteristic of extreme imbalance between numerous typical normal activities and few rare abnormalities in surveillance video data, a one-class constraint is directly imposed on the distribution of documents as a discriminant priori. By jointly learning topics and one-class document representation within a discriminative framework, the topic (pattern) space is more specific and explicit. An effective alternative iteration algorithm is presented for the model learning. Experimental results and comparisons on various public data sets demonstrate the promise of the proposed approach.

引用

页码：5198 / 5208

页数：11

共 30 条

[1]

[Anonymous], 2009, PROC 17 ACM INT C MU

[2]

[Anonymous], 2011, ACM T INTEL SYST TEC, DOI DOI 10.1145/1961189.1961199

[3]

Bin Zhao, 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P3313, DOI 10.1109/CVPR.2011.5995524

[4] Latent Dirichlet allocation [J].

Blei, DM ;

Ng, AY ;

Jordan, MI .

JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022

[5]

Chin-Liang Wang, 2009, 2009 IEEE Wireless Communications and Networking Conference, DOI 10.1109/WCNC.2009.4917575

[6]

Wang C, 2009, PROC CVPR IEEE, P1903, DOI [10.1109/CVPR.2009.5206800, 10.1109/CVPRW.2009.5206800]

[7] Sparse Reconstruction Cost for Abnormal Event Detection [J].

Cong, Yang ;

Yuan, Junsong ;

Liu, Ji .

2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, :1807-+

[8]

Duchi J., 2008, P 25 INT C MACH LEAR, P272, DOI DOI 10.1145/1390156.1390191

[9] A Markov Clustering Topic Model for Mining Behaviour in Video [J].

Hospedales, Timothy ;

Gong, Shaogang ;

Xiang, Tao .

2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :1165-1172

[10] Identifying Rare and Subtle Behaviors: A Weakly Supervised Joint Topic Model [J].

Hospedales, Timothy M. ;

Li, Jian ;

Gong, Shaogang ;

Xiang, Tao .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (12) :2451-2464

← 1 2 3 →