Temporal Sequence Modeling for Video Event Detection

被引:44
作者
Cheng, Yu [1 ]
Fan, Quanfu [1 ]
Pankanti, Sharath [1 ]
Choudhary, Alok [2 ]
机构
[1] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] Northwestern Univ, Dept EECS, Evanston, IL 60208 USA
来源
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年
关键词
RECOGNITION;
D O I
10.1109/CVPR.2014.286
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel approach for event detection in video by temporal sequence modeling. Exploiting temporal information has lain at the core of many approaches for video analysis (i.e. action, activity and event recognition). Unlike previous works doing temporal modeling at semantic event level, we propose to model temporal dependencies in the data at sub-event level without using event annotations. This frees our model from ground truth and addresses several limitations in previous work on temporal modeling. Based on this idea, we represent a video by a sequence of visual words learnt from the video, and apply the Sequence Memoizer [21] to capture long-range dependencies in a temporal context in the visual sequence. This data-driven temporal model is further integrated with event classification for jointly performing segmentation and classification of events in a video. We demonstrate the efficacy of our approach on two challenging datasets for visual recognition.
引用
收藏
页码:2235 / 2242
页数:8
相关论文
共 24 条
[1]  
[Anonymous], 2007, P IEEE C COMP VIS PA
[2]  
[Anonymous], 2011, TRECVID 2010 OVERVIE
[3]  
[Anonymous], 2009, Proceedings of the 26th Annual International Conference on Machine Learning
[4]  
[Anonymous], P21 ACMINT C MULT MM
[5]  
[Anonymous], P TRECVID 2012
[6]  
Brown P. F., 1992, Computational Linguistics, V18, P467
[7]  
Chen M.-y., 2009, MOSIFT RECOGNIZING H
[8]   On the algorithmic implementation of multiclass kernel-based vector machines [J].
Crammer, K ;
Singer, Y .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (02) :265-292
[9]  
Fan QF, 2009, PROC CVPR IEEE, P943, DOI 10.1109/CVPRW.2009.5206644
[10]   Large-scale event detection using semi-hidden Markov models [J].
Hongeng, S ;
Nevatia, R .
NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, :1455-1462