Human Fall-down Event Detection Based on 2D Skeletons and Deep Learning Approach

被引:0
作者
Lie, Wen-Nung [1 ]
Anh Tu Le [2 ]
Lin, Guan-Han [1 ]
机构
[1] Natl Chung Cheng Univ, Dept Elect Engn, Chiayi, Taiwan
[2] Ho Chi Minh City Univ Technol, Fac Elect & Elect Engn, Ho Chi Minh City, Vietnam
来源
2018 INTERNATIONAL WORKSHOP ON ADVANCED IMAGE TECHNOLOGY (IWAIT) | 2018年
关键词
Fall-down event detection; human action recognition; deep learning; human skeleton;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The goal of this research is to apply the state-of-the-art deep learning approach to human fall down event detection based on 2D skeletons extracted from RGB sequence. In this paper, we adopt convolutional neural network (CNN) to extract humans' 2D skeletons for each input frame and then employ recurrent neural network (RNN) with Long Short Term Memory (LSTM) state cells to process temporal skeleton series to make best use of not only spatial features but also temporal information to classify each short-term action to five categories, i.e., standing, walking, falling, lying, and rising. After simple rule processing, the consecutive RNN outputs can be used to detect human's long-term actions (falling down event) and determine whether to issue an alarm or not. The accuracy of classification into 5 sub-actions is capable of achieving 90%. Our contributions lie on two aspects: (1) improving the performance on short-term human action recognition based on the combination of CNN and RNN/LSTM, (2) excluding the fall-down events that actually need no help and achieving a lower false alarm rate.
引用
收藏
页数:4
相关论文
共 11 条
  • [1] [Anonymous], P IEEE INT C COMP VI
  • [2] [Anonymous], 2016, Lecture Notes in Computer Science, DOI [10.1007/978-3-319-46493-0_38, DOI 10.1007/978-3-319-46493-0_38]
  • [3] A Human Activity Recognition System Using Skeleton Data from RGBD Sensors
    Cippitelli, Enea
    Gasparrini, Samuele
    Gambi, Ennio
    Spinsante, Susanna
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
  • [4] Donahue J, 2015, PROC CVPR IEEE, P2625, DOI 10.1109/CVPR.2015.7298878
  • [5] Du Yong, 2015, 3 IAPR AS C PATT REC
  • [6] Hai P. T., 2016 IEEE 6 INT C CO
  • [7] Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]
  • [8] DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model
    Insafutdinov, Eldar
    Pishchulin, Leonid
    Andres, Bjoern
    Andriluka, Mykhaylo
    Schiele, Bernt
    [J]. COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 34 - 50
  • [9] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [10] Ramakrishna V, 2014, LECT NOTES COMPUT SC, V8690, P33, DOI 10.1007/978-3-319-10605-2_3