Large-scale Isolated Gesture Recognition Using Convolutional Neural Networks

被引:0
|
作者
Wang, Pichao [1 ]
Li, Wanqing [1 ]
Liu, Song [1 ]
Gao, Zhimin [1 ]
Tang, Chang [2 ]
Ogunbona, Philip [1 ]
机构
[1] Univ Wollongong, Adv Multimedia Res Lab, Wollongong, NSW, Australia
[2] Wuhan Univ Sci & Technol, Sch Informat Sci & Engn, Wuhan, Hubei, Peoples R China
关键词
gesture recognition; depth map sequences; Convolutional Neural Networks;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes three simple, compact yet effective representations of depth sequences, referred to respectively as Dynamic Depth Images (DDI), Dynamic Depth Normal Images (DDNI) and Dynamic Depth Motion Normal Images (DDMNI). These dynamic images are constructed from a sequence of depth maps using bidirectional rank pooling to effectively capture the spatial-temporal information. Such image-based representations enable us to fine-tune the existing ConvNets models trained on image data for classification of depth sequences, without introducing large parameters to learn. Upon the proposed representations, a convolutional Neural networks (ConvNets) based method is developed for gesture recognition and evaluated on the Large-scale Isolated Gesture Recognition at the ChaLearn Looking at People (LAP) challenge 2016. The method achieved 55.57% classification accuracy and ranked 2(nd) place in this challenge but was very close to the best performance even though we only used depth data.
引用
收藏
页码:7 / 12
页数:6
相关论文
共 50 条
  • [1] Large-scale Continuous Gesture Recognition Using Convolutional Neural Networks
    Wang, Pichao
    Li, Wanqing
    Liu, Song
    Zhang, Yuyao
    Gao, Zhimin
    Ogunbona, Philip
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 13 - 18
  • [2] Large-scale Isolated Gesture Recognition using Pyramidal 3D Convolutional Networks
    Zhu, Guangming
    Zhang, Liang
    Mei, Lin
    Shao, Jie
    Song, Juan
    Shen, Peiyi
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 19 - 24
  • [3] Large-scale Multimodal Gesture Segmentation and Recognition based on Convolutional Neural Networks
    Wang, Huogen
    Wang, Pichao
    Song, Zhanjie
    Li, Wanqing
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 3138 - 3146
  • [4] Large-scale Multimodal Gesture Recognition Using Heterogeneous Networks
    Wang, Huogen
    Wang, Pichao
    Song, Zhanjie
    Li, Wanqing
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 3129 - 3137
  • [5] Two Streams Recurrent Neural Networks for Large-Scale Continuous Gesture Recognition
    Chai, Xiujuan
    Liu, Zhipeng
    Yin, Fang
    Liu, Zhuang
    Chen, Xilin
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 31 - 36
  • [6] Hand Gesture Recognition using Convolutional Neural Networks
    Lan, Shengchang
    He, Zonglong
    Chen, Weichu
    Chen, Lijia
    2018 USNC-URSI RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2018, : 147 - 148
  • [7] On the Large-Scale Transferability of Convolutional Neural Networks
    Zheng, Liang
    Zhao, Yali
    Wang, Shengjin
    Wang, Jingdong
    Yang, Yi
    Tian, Qi
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING: PAKDD 2018 WORKSHOPS, 2018, 11154 : 27 - 39
  • [8] Hand Gesture Recognition Using Deep Convolutional Neural Networks
    Strezoski, Gjorgji
    Stojanovski, Dario
    Dimitrovski, Ivica
    Madjarov, Gjorgji
    ICT INNOVATIONS 2016: COGNITIVE FUNCTIONS AND NEXT GENERATION ICT SYSTEMS, 2018, 665 : 49 - 58
  • [9] Large-scale parcellation of the ventricular system using convolutional neural networks
    Atlason, Hans E.
    Shao, Muhan
    Robertsson, Vidar
    Sigurdsson, Sigurdur
    Gudnason, Vilmundur
    Prince, Jerry L.
    Ellingsen, Lotta M.
    MEDICAL IMAGING 2019: BIOMEDICAL APPLICATIONS IN MOLECULAR, STRUCTURAL, AND FUNCTIONAL IMAGING, 2019, 10953
  • [10] Large-scale Video Classification with Convolutional Neural Networks
    Karpathy, Andrej
    Toderici, George
    Shetty, Sanketh
    Leung, Thomas
    Sukthankar, Rahul
    Fei-Fei, Li
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1725 - 1732