Stacked Spatio-Temporal Graph Convolutional Networks for Action Segmentation

被引:0
|
作者
Ghosh, Pallabi [1 ]
Yao, Yi [1 ]
Davis, Larry S. [1 ]
Divakaran, Ajay [2 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] SRI Int, 333 Ravenswood Ave, Menlo Pk, CA 94025 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose novel Stacked Spatio-Temporal Graph Convolutional Networks (Stacked-STGCN) for action segmentation, i.e., predicting and localizing a sequence of actions over long videos. We extend the Spatio-Temporal Graph Convolutional Network (STGCN) originally proposed for skeleton-based action recognition to enable nodes with different characteristics (e.g., scene, actor, object, action), feature descriptors with varied lengths, and arbitrary temporal edge connections to account for large graph deformation commonly associated with complex activities. We further introduce the stacked hourglass architecture to STGCN to leverage the advantages of an encoder-decoder design for improved generalization performance and localization accuracy. We explore various descriptors such as framelevel VGG, segment-level I3D, RCNN-based object, etc. as node descriptors to enable action segmentation based on joint inference over comprehensive contextual information. We show results on CAD120 (which provides pre-computed node features and edge weights for fair performance comparison across algorithms) as well as a more complex real-world activity dataset, Charades. Our Stacked-STGCN in general achieves improved performance over the state-of-the-art for both CAD120 and Charades. Moreover, due to its generic design, Stacked-STGCN can be applied to a wider range of applications that require structured inference over long sequences with heterogeneous data types and varied temporal extent.
引用
收藏
页码:565 / 574
页数:10
相关论文
共 50 条
  • [1] Spatio-Temporal Action Graph Networks
    Herzig, Roei
    Levi, Elad
    Xu, Huijuan
    Gao, Hang
    Brosh, Eli
    Wang, Xiaolong
    Globerson, Amir
    Darrell, Trevor
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2347 - 2356
  • [2] Continual spatio-temporal graph convolutional networks
    Hedegaard, Lukas
    Heidari, Negar
    Iosifidis, Alexandros
    PATTERN RECOGNITION, 2023, 140
  • [3] Spatio-Temporal Inception Graph Convolutional Networks for Skeleton-Based Action Recognition
    Huang, Zhen
    Shen, Xu
    Tian, Xinmei
    Li, Houqiang
    Huang, Jianqiang
    Hua, Xian-Sheng
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2122 - 2130
  • [4] Implementating Spatio-Temporal Graph Convolutional Networks on Graphcore IPUs
    Moe, Johannes
    Pogorelov, Konstantin
    Schroeder, Daniel Thilo
    Langguth, Johannes
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 45 - 54
  • [5] Spatio-Temporal Joint Graph Convolutional Networks for Traffic Forecasting
    Zheng, Chuanpan
    Fan, Xiaoliang
    Pan, Shirui
    Jin, Haibing
    Peng, Zhaopeng
    Wu, Zonghan
    Wang, Cheng
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (01) : 372 - 385
  • [6] SPATIO-TEMPORAL GRAPH CONVOLUTIONAL NETWORKS FOR CONTINUOUS SIGN LANGUAGE RECOGNITION
    Parelli, Maria
    Papadimitriou, Katerina
    Potamianos, Gerasimos
    Pavlakos, Georgios
    Maragos, Petros
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8457 - 8461
  • [7] Spatio-temporal adaptive graph convolutional networks for traffic flow forecasting
    Ma, Qiwei
    Sun, Wei
    Gao, Junbo
    Ma, Pengwei
    Shi, Mengjie
    IET INTELLIGENT TRANSPORT SYSTEMS, 2023, 17 (04) : 691 - 703
  • [8] Conflict Forecasting with Event Data and Spatio-Temporal Graph Convolutional Networks
    Brandt, Patrick T.
    D'Orazio, Vito
    Khan, Latifur
    Li, Yi-Fan
    Osorio, Javier
    Sianan, Marcus
    INTERNATIONAL INTERACTIONS, 2022, 48 (04) : 800 - 822
  • [9] DYNAMIC SPATIO-TEMPORAL GRAPH CONVOLUTIONAL NETWORKS FOR CARDIAC MOTION ANALYSIS
    Lu, Ping
    Bai, Wenjia
    Rueckert, Daniel
    Noble, J. Alison
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 122 - 125
  • [10] Position-aware spatio-temporal graph convolutional networks for skeleton-based action recognition
    Yang, Ping
    Wang, Qin
    Chen, Hao
    Wu, Zizhao
    IET COMPUTER VISION, 2023, 17 (07) : 844 - 854