Robust human activity recognition from depth video using spatiotemporal multi-fused features

被引:249
|
作者
Jalal, Ahmad [1 ]
Kim, Yeon-Ho [1 ]
Kim, Yong-Joong [1 ]
Kamal, Shaharyar [2 ]
Kim, Daijin [1 ]
机构
[1] POSTECH, Dept Comp Sci & Engn, San 31, Pohang 790784, South Korea
[2] Kyung Hee Univ, Dept Elect & Radio Engn, Yongin 446701, South Korea
关键词
Human activity recognition; Depth silhouette; Skeleton joint extraction; Spatiotemporal multi-fused feature extraction; Hidden Markov model; Forward spotting scheme; ALGORITHM; SYSTEM; POSE;
D O I
10.1016/j.patcog.2016.08.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recently developed depth imaging technologies have provided new directions for human activity recognition (HAR) without attaching optical markers or any other motion sensors to human body parts. In this paper, we propose novel multi-fused features for online human activity recognition (HAR) system that recognizes human activities from continuous sequences of depth map. The proposed online HAR system segments human depth silhouettes using temporal human motion information as well as it obtains human skeleton joints using spatiotemporal human body information. Then, it extracts the spatiotemporal multi-fused features that concatenate four skeleton joint features and one body shape feature. Skeleton joint features include the torso-based distance feature (DT), the key joint-based distance feature (DK), the spatiotemporal magnitude feature (M) and the spatiotemporal directional angle feature (theta). The body shape feature called HOG-DDS represents the projections of the depth differential silhouettes (DDS) between two consecutive frames onto three orthogonal planes by the histogram of oriented gradients (HOG) format. The size of the proposed spatiotemporal multi-fused feature is reduced by a code vector in the code book which is generated by vector quantization method. Then, it trains the hidden Markov model (HMM) with the code vectors of the multi-fused features and recognizes the segmented human activity by the forward spotting scheme using the trained HMM-based human activity classifiers. The experimental results on three challenging depth video datasets such as IM-Daily-DepthActivity, MSRAction3D and MSRDailyActivity3D demonstrate that the proposed online HAR method using the proposed multi-fused features outperforms the state-of-the-art HAR methods in terms of recognition accuracy. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:295 / 308
页数:14
相关论文
共 50 条
  • [1] Video Recognition of Human Fall Based on Spatiotemporal Features
    Wang, Kai
    Zhao, Youjin
    Xiong, Qingyu
    Shen, Xiling
    Fan, Min
    Gao, Min
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2016, 22 (02): : 303 - 309
  • [2] A Spatiotemporal Robust Approach for Human Activity Recognition
    Uddin, Md. Zia
    Kim, Tae-Seong
    Kim, Jeong-Tai
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2013, 10
  • [3] Depth Images-based Human Detection, Tracking and Activity Recognition Using Spatiotemporal Features and Modified HMM
    Kamal, Shaharyar
    Jalal, Ahmad
    Kim, Daijin
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2016, 11 (06) : 1857 - 1862
  • [4] A Depth Video-based Human Detection and Activity Recognition using Multi-features and Embedded Hidden Markov Models for Health Care Monitoring Systems
    Jalal, Ahmad
    Kamal, Shaharyar
    Kim, Daijin
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2017, 4 (04): : 54 - 62
  • [5] Depth-based human activity recognition via multi-level fused features and fast broad learning system
    Yao, Huang
    Yang, Mengting
    Chen, Tiantian
    Wei, Yantao
    Zhang, Yu
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2020, 16 (02)
  • [6] A Robust Gait Recognition System Using Spatiotemporal Features and Deep Learning
    Uddin, Md Zia
    Khaksar, Weria
    Torresen, Jim
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS (MFI), 2017, : 156 - 161
  • [7] A Robust Human Activity Recognition Approach Using OpenPose, Motion Features, and Deep Recurrent Neural Network
    Noori, Farzan Majeed
    Wallace, Benedikte
    Uddin, Md Zia
    Torresen, Jim
    IMAGE ANALYSIS, 2019, 11482 : 299 - 310
  • [8] Depth Video-based Human Activity Recognition System Using Translation and Scaling Invariant Features for Life Logging at Smart Home
    Jalal, A.
    Uddin, Md Zia
    Kim, T. -S.
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (03) : 863 - 871
  • [9] On integration of multiple features for human activity recognition in video sequences
    Kushwaha, Arati
    Khare, Ashish
    Srivastava, Prashant
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (21-23) : 32511 - 32538
  • [10] On integration of multiple features for human activity recognition in video sequences
    Arati Kushwaha
    Ashish Khare
    Prashant Srivastava
    Multimedia Tools and Applications, 2021, 80 : 32511 - 32538