Lightweight Multiscale Spatiotemporal Locally Connected Graph Convolutional Networks for Single Human Motion Forecasting

被引:3
作者
Zhai, Di-Hua [1 ,2 ]
Yan, Zigeng [1 ]
Xia, Yuanqing [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing, Peoples R China
[2] Yangtze Delta RegionAcademy Beijing Inst Technol, Jiaxing 314001, Peoples R China
基金
中国国家自然科学基金;
关键词
Human motion forecasting; locally connected; lightweight; multiscale; spatiotemporal; GCN;
D O I
10.1109/TASE.2023.3301657
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human motion forecasting is an important and challenging task in many computer vision application domains. Recent work concentrates on utilizing the timing processing ability of recurrent neural networks (RNNs) to achieve smooth and reliable results in short-term prediction. However, as evidenced by previous works, RNNs suffer from error accumulation, leading to unreliable results. In this paper, we propose a simple feedforward deep neural network for motion prediction, which takes into account temporal smoothness between frames and spatial dependencies between human body joints. We design Lightweight Multiscale Spatiotemporal Locally Connected Graph Convolutional Networks (MST-LCGCN) for Single Human Motion Forecasting to implicitly establish the spatiotemporal dependence in the process of human movement, where different scales fuse dynamically during training. The entire model is action-agnostic and follows a framework of encoder-decoder. The encoder consists of temporal GCNs (TGCNs) to capture motion features between frames and locally connected spatial GCNs (SGCNs) to extract spatial structure among joints. The decoder uses temporal convolution networks (TCNs) to maintain its extensibility for long-term prediction. Considerable experiments show that our approach outperforms previous methods on the Human3.6M and CMU Mocap datasets while only requiring much fewer parameters.
引用
收藏
页码:4768 / 4777
页数:10
相关论文
共 30 条
[1]   A Spatio-temporal Transformer for 3D Human Motion Prediction [J].
Aksan, Emre ;
Kaufmann, Manuel ;
Cao, Peng ;
Hilliges, Otmar .
2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, :565-574
[2]   Structured Prediction Helps 3D Human Motion Modelling [J].
Aksan, Emre ;
Kaufmann, Manuel ;
Hilliges, Otmar .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7143-7152
[3]   Learning Progressive Joint Propagation for Human Motion Prediction [J].
Cai, Yujun ;
Huang, Lin ;
Wang, Yiwei ;
Cham, Tat-Jen ;
Cai, Jianfei ;
Yuan, Junsong ;
Liu, Jun ;
Yang, Xu ;
Zhu, Yiheng ;
Shen, Xiaohui ;
Liu, Ding ;
Liu, Jing ;
Thalmann, Nadia Magnenat .
COMPUTER VISION - ECCV 2020, PT VII, 2020, 12352 :226-242
[4]  
Chelba Ciprian, 2013, arXiv
[5]   Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition [J].
Chen, Tailin ;
Zhou, Desen ;
Wang, Jian ;
Wang, Shidong ;
Guan, Yu ;
He, Xuming ;
Ding, Errui .
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :4334-4342
[6]   Uni-and-Bi-Directional Video Prediction via Learning Object-Centric Transformation [J].
Chen, Xiongtao ;
Wang, Wenmin .
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (06) :1591-1604
[7]  
Chung Junyoung, 2014, CORR, DOI DOI 10.48550/ARXIV.1412.3555
[8]   Context-aware Human Motion Prediction [J].
Corona, Enric ;
Pumarola, Albert ;
Alenya, Guillem ;
Moreno-Noguer, Francesc .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6990-6999
[9]   Learning Dynamic Relationships for 3D Human Motion Prediction [J].
Cui, Qiongjie ;
Sun, Huaijiang ;
Yang, Fei .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6518-6526
[10]   MSR-GCN: Multi-Scale Residual Graph Convolution Networks for Human Motion Prediction [J].
Dang, Lingwei ;
Nie, Yongwei ;
Long, Chengjiang ;
Zhang, Qing ;
Li, Guiqing .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :11447-11456