Recognition of Complex Events: Exploiting Temporal Dynamics between Underlying Concepts

被引:43
作者
Bhattacharya, Subhabrata [1 ]
Kalaych, Mahdi M. [2 ]
Sukthankar, Rahul [3 ]
Shah, Mubarak [2 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
[2] Univ Cent Florida, Orlando, FL 32816 USA
[3] Google Res, Mountain View, CA USA
来源
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年
关键词
VIDEO;
D O I
10.1109/CVPR.2014.287
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While approaches based on bags of features excel at low-level action classification, they are ill-suited for recognizing complex events in video, where concept-based temporal representations currently dominate. This paper proposes a novel representation that captures the temporal dynamics of windowed mid-level concept detectors in order to improve complex event recognition. We first express each video as an ordered vector time series, where each time step consists of the vector formed from the concatenated confidences of the pre-trained concept detectors. We hypothesize that the dynamics of time series for different instances from the same event class, as captured by simple linear dynamical system (LDS) models, are likely to be similar even if the instances differ in terms of low-level visual features. We propose a two-part representation composed of fusing: (1) a singular value decomposition of block Hankel matrices (SSID-S) and (2) a harmonic signature (H-S) computed from the corresponding eigen-dynamics matrix. The proposed method offers several benefits over alternate approaches: our approach is straightforward to implement, directly employs existing concept detectors and can be plugged into linear classification frameworks. Results on standard datasets such as NIST's TRECVID Multimedia Event Detection task demonstrate the improved accuracy of the proposed method.
引用
收藏
页码:2243 / 2250
页数:8
相关论文
共 26 条
  • [1] [Anonymous], CVPR
  • [2] [Anonymous], PERSPECTIVES NONLINE
  • [3] [Anonymous], 2011, CVPR
  • [4] [Anonymous], CVPR
  • [5] [Anonymous], 2004, Beyond the Kalman Filter: Particle Filters for Tracking Applications
  • [6] Binlong Li, 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P3193, DOI 10.1109/CVPR.2011.5995672
  • [7] Box G.E. P., 1990, TIME SERIES ANAL
  • [8] Cao LL, 2012, LECT NOTES COMPUT SC, V7573, P688, DOI 10.1007/978-3-642-33709-3_49
  • [9] Modeling, clustering, and segmenting video with mixtures of dynamic textures
    Chan, Antoni B.
    Vasconcelos, Nuno
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (05) : 909 - 926
  • [10] Chaudhry R, 2009, PROC CVPR IEEE, P1932, DOI 10.1109/CVPRW.2009.5206821