Spatio-temporal human action localization in indoor surveillances

被引:3
|
作者
Liu, Zihao [1 ]
Yan, Danfeng [1 ]
Cai, Yuanqiang [1 ]
Song, Yan [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Natl Pilot Software Engn Sch, Beijing 100876, Peoples R China
[2] Shanghai Int Studies Univ, Sch Business & Management, Shanghai 200083, Peoples R China
关键词
Video analysis; Spatio-temporal action localization dataset; Real-world indoor surveillance;
D O I
10.1016/j.patcog.2023.110087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spatio-temporal action localization is a crucial and challenging task in the field of video understanding. Existing benchmarks for spatio-temporal action detection are limited by factors such as incomplete annotations, highlevel non-universal actions, and uncommon scenarios. To address these limitations and facilitate research in real-world security applications, we introduce a novel human-centric dataset for spatio-temporal localization of atomic actions in indoor surveillance settings, termed as HIA (Human-centric Indoor Actions). The HIA dataset is constructed by selecting 30 atomic action classes, compiling 100 surveillance videos, and annotating 219,225 frames with 370,937 bounding boxes. The primary characteristics of HIA include (1) accurate spatiotemporal annotations for atomic actions, (2) human-centric annotations at the frame level, (3) temporal linking of persons across discontinuous tracks, and (4) utilization of indoor surveillance videos. Our HIA, with its realistic settings in indoor surveillance scenes and comprehensive annotations, presents a valuable and novel challenge to the spatio-temporal action localization domain. To establish a benchmark, we evaluate various methods and provide an in-depth analysis of the HIA dataset. The HIA dataset will be made available soon, and we anticipate that it will serve as a standard and practical benchmark for the research community.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Spatio-temporal human action localization in indoor surveillances
    Liu, Zihao
    Yan, Danfeng
    Cai, Yuanqiang
    Song, Yan
    Pattern Recognition, 2024, 147
  • [2] Spatio-Temporal Action Localization for Human Action Recognition in Large Dataset
    Megrhi, Sameh
    Jmal, Marwa
    Beghdadi, Azeddine
    Mseddi, Wided
    VIDEO SURVEILLANCE AND TRANSPORTATION IMAGING APPLICATIONS 2015, 2015, 9407
  • [3] Action Tubelet Detector for Spatio-Temporal Action Localization
    Kalogeiton, Vicky
    Weinzaepfel, Philippe
    Ferrari, Vittorio
    Schmid, Cordelia
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4415 - 4423
  • [4] Learning to track for spatio-temporal action localization
    Weinzaepfel, Philippe
    Harchaoui, Zaid
    Schmid, Cordelia
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3164 - 3172
  • [5] Spatio-temporal action localization and detection for human recognition in big dataset
    Megrhi, Sameh
    Jmal, Marwa
    Souidene, Wided
    Beghdadi, Azeddine
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 41 : 375 - 390
  • [6] Improved Spatio-temporal Action Localization for Surveillance Videos
    Liang, Morgan
    Li, Xun
    Onie, Sandersan
    Larsen, Mark
    Sowmya, Arcot
    2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 147 - 154
  • [7] Spatio-temporal information for human action recognition
    Yao, Li
    Liu, Yunjian
    Huang, Shihui
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2016,
  • [8] Spatio-temporal information for human action recognition
    Li Yao
    Yunjian Liu
    Shihui Huang
    EURASIP Journal on Image and Video Processing, 2016
  • [9] Com-STAL: Compositional Spatio-Temporal Action Localization
    Wang, Shaomeng
    Yan, Rui
    Huang, Peng
    Dai, Guangzhao
    Song, Yan
    Shu, Xiangbo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7645 - 7657
  • [10] Local and Global Context Reasoning for Spatio-Temporal Action Localization
    Ando, Ryuhei
    Babazaki, Yasunori
    Takahashi, Katsuhiko
    ADVANCES IN VISUAL COMPUTING, ISVC 2023, PT I, 2023, 14361 : 147 - 159