Video Parsing for Abnormality Detection

被引:0
|
作者
Antic, Borislav [1 ]
Ommer, Bjoern [1 ]
机构
[1] Heidelberg Univ, Interdisciplinary Ctr Sci Comp, D-6900 Heidelberg, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting abnormalities in video is a challenging problem since the class of all irregular objects and behaviors is infinite and thus no (or by far not enough) abnormal training samples are available. Consequently, a standard setting is to find abnormalities without actually knowing what they are because we have not been shown abnormal examples during training. However, although the training data does not define what an abnormality looks like, the main paradigm in this field is to directly search for individual abnormal local patches or image regions independent of another. To address this problem we parse video frames by establishing a set of hypotheses that jointly explain all the foreground while, at same time, trying to find normal training samples that explain the hypotheses. Consequently, we can avoid a direct detection of abnormalities. They are discovered indirectly as those hypotheses which are needed for covering the foreground without finding an explanation by normal samples for themselves. We present a probabilistic model that localizes abnormalities using statistical inference. On the challenging dataset of [15] it outperforms the state-of-the-art by 7% to achieve a frame-based abnormality classification performance of 91% and the localization performance improves by 32% to 76%.
引用
收藏
页码:2415 / 2422
页数:8
相关论文
共 50 条
  • [41] Surveillance Video Parsing with Single Frame Supervision
    Liu, Si
    Wang, Changhu
    Qian, Ruihe
    Yu, Han
    Bao, Renda
    Sun, Yao
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1013 - 1021
  • [42] Soft video parsing by label distribution learning
    Miaogen Ling
    Xin Geng
    Frontiers of Computer Science, 2019, 13 : 302 - 317
  • [44] Active Assembly Guidance with Online Video Parsing
    Wang, Bin
    Wang, Guofeng
    Sharf, Andrei
    Li, Yangyan
    Zhong, Fan
    Qin, Xueying
    CohenOr, Daniel
    Chen, Baoquan
    25TH 2018 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES (VR), 2018, : 459 - 466
  • [45] Video parsing via spatiotemporally analysis with images
    Li, Xuelong
    Mou, Lichao
    Lu, Xiaoqiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (19) : 11961 - 11976
  • [46] Event Detection as Graph Parsing
    Xie, Jianye
    Sun, Haotong
    Zhou, Junsheng
    Qu, Weiguang
    Dai, Xinyu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1630 - 1640
  • [47] Learning and parsing video events with goal and intent prediction
    Pei, Mingtao
    Si, Zhangzhang
    Yao, Benjamin Z.
    Zhu, Song-Chun
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (10) : 1369 - 1383
  • [48] Parallel parsing of MPEG video in heterogeneous distributed environment
    Nam, Y
    Hwang, E
    HIGH-SPEED NETWORKS AND MULTIMEDIA COMMUNICATIONS, PROCEEDINGS, 2003, 2720 : 264 - 274
  • [49] Motion based parsing for video from Observational Psychology
    Kokaram, A
    Doyle, E
    Lennon, D
    Joyeux, L
    Fuller, R
    MULTIMEDIA CONTENT ANALYSIS, MANAGEMENT, AND RETRIEVAL 2006, 2006, 6073
  • [50] Audiovisual integration with Segment Models for tennis video parsing
    Delakis, Manolis
    Gravier, Guillaume
    Gros, Patrick
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 111 (02) : 142 - 154