Multimodal Dynamic Networks for Gesture Recognition

被引:10
|
作者
Wu, Di [1 ]
Shao, Ling [1 ]
机构
[1] Univ Sheffield, Dept Elect & Elect Engn, Sheffield S1 3JD, S Yorkshire, England
来源
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14) | 2014年
关键词
Gesture Recognition; Human-Computer Interaction; Multimodal Fusion; Deep Belief Networks;
D O I
10.1145/2647868.2654969
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Multimodal input is a real-world situation in gesture recognition applications such as sign language recognition. In this paper, we propose a novel bi-modal (audio and skeleton joints) dynamic network for gesture recognition. First, state-of-the-art dynamic Deep Belief Networks are deployed to extract high level audio and skeletal joints representations. Then, instead of traditional late fusion, we adopt another layer of perceptron for cross modality learning taking the input from each individual net's penultimate layer. Finally, to account for temporal dynamics, the learned shared representations are used for estimating the emission probability to infer action sequences. In particular, we demonstrate that multimodal feature learning will extract semantically meaningful shared representations, outperforming individual modalities, and the early fusion scheme's efficacy against the traditional method of late fusion.
引用
收藏
页码:945 / 948
页数:4
相关论文
共 50 条
  • [41] A Dynamic hand gesture recognition system for controlling VLC media player
    Paliwal, Manuj
    Sharma, Gaurav
    Nath, Dina
    Rathore, Astitwa
    Mishra, Himanshu
    Mondal, Soumik
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN TECHNOLOGY AND ENGINEERING (ICATE), 2013,
  • [42] Dynamic gesture recognition based on feature fusion network and variant ConvLSTM
    Peng, Yuqing
    Tao, Huifang
    Li, Wei
    Yuan, Hongtao
    Li, Tiejun
    IET IMAGE PROCESSING, 2020, 14 (11) : 2480 - 2486
  • [43] Recognition of complex dynamic gesture based on HMM-FNN model
    Wang, Xi-Ying
    Dai, Guo-Zhong
    Zhang, Xi-Wen
    Zhang, Feng-Jun
    Ruan Jian Xue Bao/Journal of Software, 2008, 19 (09): : 2302 - 2312
  • [44] Hand Gesture Recognition using Neural Networks
    Murthy, G. R. S.
    Jadon, R. S.
    2010 IEEE 2ND INTERNATIONAL ADVANCE COMPUTING CONFERENCE, 2010, : 134 - 138
  • [45] End-to-End Dynamic Gesture Recognition Using MmWave Radar
    Ali, Anum
    Parida, Priyabrata
    Va, Vutha
    Ni, Saifeng
    Nguyen, Khuong Nhat
    Ng, Boon Loong
    Zhang, Jianzhong Charlie
    IEEE ACCESS, 2022, 10 : 88692 - 88706
  • [46] Static and dynamic hand-gesture recognition for augmented reality applications
    Reifinger, Stefan
    Wallhoff, Frank
    Ablassmeier, Markus
    Poitschke, Tony
    Rigoll, Gerhard
    HUMAN-COMPUTER INTERACTION, PT 3, PROCEEDINGS, 2007, 4552 : 728 - +
  • [47] Intelligent Gesture Recognition To Design More Efficient & Intelligent Multimodal Systemd
    Chhabria, S. A.
    Dharaskar, R. V.
    Thakare, V. M.
    2013 Sixth International Conference on Emerging Trends in Engineering and Technology (ICETET 2013), 2013, : 193 - 194
  • [48] Multimodal optimal matching and augmentation method for small sample gesture recognition
    Zhang, Wenli
    Liu, Bo
    Zhao, Tingsong
    Qie, Shuyan
    BIOSCIENCE TRENDS, 2025, 19 (01) : 125 - 139
  • [49] Gesture Feature Extraction for Static Gesture Recognition
    Hasan, Haitham Sabah
    Kareem, Sameem Binti Abdul
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2013, 38 (12) : 3349 - 3366
  • [50] Gesture Recognition Summarization
    ZHANG Ting-fang
    FENG Zhi-quan
    SU Yuan-yuan
    JIANG Yan
    Computer Aided Drafting,Design and Manufacturing, 2014, (03) : 1 - 5