Human Fall-down Event Detection Based on 2D Skeletons and Deep Learning Approach

被引：0

作者：

Lie, Wen-Nung ^{[1
]}

Anh Tu Le ^{[2
]}

Lin, Guan-Han ^{[1
]}

机构：

[1] Natl Chung Cheng Univ, Dept Elect Engn, Chiayi, Taiwan

[2] Ho Chi Minh City Univ Technol, Fac Elect & Elect Engn, Ho Chi Minh City, Vietnam

来源：

2018 INTERNATIONAL WORKSHOP ON ADVANCED IMAGE TECHNOLOGY (IWAIT) | 2018年

关键词：

Fall-down event detection; human action recognition; deep learning; human skeleton;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The goal of this research is to apply the state-of-the-art deep learning approach to human fall down event detection based on 2D skeletons extracted from RGB sequence. In this paper, we adopt convolutional neural network (CNN) to extract humans' 2D skeletons for each input frame and then employ recurrent neural network (RNN) with Long Short Term Memory (LSTM) state cells to process temporal skeleton series to make best use of not only spatial features but also temporal information to classify each short-term action to five categories, i.e., standing, walking, falling, lying, and rising. After simple rule processing, the consecutive RNN outputs can be used to detect human's long-term actions (falling down event) and determine whether to issue an alarm or not. The accuracy of classification into 5 sub-actions is capable of achieving 90%. Our contributions lie on two aspects: (1) improving the performance on short-term human action recognition based on the combination of CNN and RNN/LSTM, (2) excluding the fall-down events that actually need no help and achieving a lower false alarm rate.

引用

页数：4

共 11 条

[1] [Anonymous], P IEEE INT C COMP VI
[2] [Anonymous], 2016, Lecture Notes in Computer Science, DOI [10.1007/978-3-319-46493-0_38, DOI 10.1007/978-3-319-46493-0_38]
[3] A Human Activity Recognition System Using Skeleton Data from RGBD Sensors
Cippitelli, Enea
Gasparrini, Samuele
Gambi, Ennio
Spinsante, Susanna
[J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
[4] Donahue J, 2015, PROC CVPR IEEE, P2625, DOI 10.1109/CVPR.2015.7298878
[5] Du Yong, 2015, 3 IAPR AS C PATT REC
[6] Hai P. T., 2016 IEEE 6 INT C CO
[7] Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]
[8] DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model
Insafutdinov, Eldar
Pishchulin, Leonid
Andres, Bjoern
Andriluka, Mykhaylo
Schiele, Bernt
[J]. COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 34 - 50
[9] ImageNet Classification with Deep Convolutional Neural Networks
Krizhevsky, Alex
Sutskever, Ilya
Hinton, Geoffrey E.
[J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
[10] Ramakrishna V, 2014, LECT NOTES COMPUT SC, V8690, P33, DOI 10.1007/978-3-319-10605-2_3

← 1 2 →