OFPI: Optical Flow Pose Image for Action Recognition

被引:2
|
作者
Chen, Dong [1 ,2 ]
Zhang, Tao [2 ]
Zhou, Peng [1 ]
Yan, Chenyang [3 ]
Li, Chuanqi [1 ,2 ]
机构
[1] Guangxi Normal Univ, Coll Comp Sci & Engn, Guilin 541004, Peoples R China
[2] Nanning Normal Univ, Coll Phys & Elect Engn, Nanning 530001, Peoples R China
[3] Kanazawa Univ, Div Elect Engn & Comp Sci, Kanazawa 9201192, Japan
关键词
action recognition; optical flow pose image; skeletal data; transformer;
D O I
10.3390/math11061451
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Most approaches to action recognition based on pseudo-images involve encoding skeletal data into RGB-like image representations. This approach cannot fully exploit the kinematic features and structural information of human poses, and convolutional neural network (CNN) models that process pseudo-images lack a global field of view and cannot completely extract action features from pseudo-images. In this paper, we propose a novel pose-based action representation method called Optical Flow Pose Image (OFPI) in order to fully capitalize on the spatial and temporal information of skeletal data. Specifically, in the proposed method, an advanced pose estimator collects skeletal data before locating the target person and then extracts skeletal data utilizing a human tracking algorithm. The OFPI representation is obtained by aggregating these skeletal data over time. To test the superiority of OFPI and investigate the significance of the model having a global field of view, we trained a simple CNN model and a transformer-based model, respectively. Both models achieved superior outcomes. Because of the global field of view, especially in the transformer-based model, the OFPI-based representation achieved 98.3% and 94.2% accuracy on the KTH and JHMDB datasets, respectively. Compared with other advanced pose representation methods and multi-stream methods, OFPI achieved state-of-the-art performance on the JHMDB dataset, indicating the utility and potential of this algorithm for skeleton-based action recognition research.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Action Recognition from Pose Signature in Static Image
    Qian, Yinzhong
    Chen, Wenbin
    Shen, I-Fan
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2016, 30 (03)
  • [2] Human Body Pose Distance Image Analysis for Action Recognition
    Verma, Amit
    Meenpal, Toshanlal
    Acharya, Bibhudendra
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (07)
  • [3] Optical flow-motion history image (OF-MHI) for action recognition
    Tsai, Du-Ming
    Chiu, Wei-Yao
    Lee, Men-Han
    SIGNAL IMAGE AND VIDEO PROCESSING, 2015, 9 (08) : 1897 - 1906
  • [4] Optical flow-motion history image (OF-MHI) for action recognition
    Du-Ming Tsai
    Wei-Yao Chiu
    Men-Han Lee
    Signal, Image and Video Processing, 2015, 9 : 1897 - 1906
  • [5] Action recognition from mutually incoherent pose bases in static image
    Qian, Yinzhong
    Chen, Wenbin
    Shen, I-fan
    IET COMPUTER VISION, 2018, 12 (03) : 233 - 240
  • [6] Double-Stream Convolutional Networks with Sequential Optical Flow Image for Action Recognition
    Li Qinghui
    Li Aihua
    Wang Tao
    Cui Zhigao
    ACTA OPTICA SINICA, 2018, 38 (06)
  • [7] On the Combination of IMU and Optical Flow for Action Recognition
    Alhersh, Taha
    Stuckenschmidt, Heiner
    2019 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2019, : 17 - 21
  • [8] A 99.4 fps Optical Flow Estimation Processor with Image Tiling for Action Recognition in Mobile Devices
    Lee, Juhyoung
    Choi, Sungpill
    Lee, Jinmook
    Kang, Sanghoon
    Yoo, Hoi-Jun
    JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, 2019, 19 (01) : 116 - 123
  • [9] Enriching Optical Flow with Appearance Information for Action Recognition
    Pan, Yijun
    Sun, Xiaoyan
    Wu, Feng
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 251 - 254
  • [10] Using Phase Instead of Optical Flow for Action Recognition
    Hommos, Omar
    Pintea, Silvia L.
    Mettes, Pascal S. M.
    van Gemert, Jan C.
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT VI, 2019, 11134 : 678 - 691