A novel recurrent hybrid network for feature fusion in action recognition

被引:15
|
作者
Yu, Sheng [1 ,2 ,3 ]
Cheng, Yun [2 ]
Xie, Li [2 ]
Luo, Zhiming [1 ,3 ]
Huang, Min [1 ,3 ]
Li, Shaozi [1 ,3 ]
机构
[1] Xiamen Univ, Dept Cognit Sci, Xiamen 361005, Fujian, Peoples R China
[2] Hunan Univ Humanities Sci & Technol, Sch Informat, Loudi, Hunan, Peoples R China
[3] Fujian Key Lab Brain Intelligent Syst, Xiamen, Fujian, Peoples R China
关键词
Deep learning; Action recognition; LSTM; CNNs; IDT; REPRESENTATION;
D O I
10.1016/j.jvcir.2017.09.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Action recognition in video is one of the most important and challenging tasks in computer vision. How to efficiently combine the spatial-temporal information to represent video plays a crucial role for action recognition. In this paper, a recurrent hybrid network architecture is designed for action recognition by fusing multi-source features: a two-stream CNNs for learning semantic features, a two-stream single-layer LSTM for learning long-term temporal feature, and an Improved Dense Trajectories (IDT) stream for learning short-term temporal motion feature. In order to mitigate the overfitting issue on small-scale dataset, a video data augmentation method is used to increase the amount of training data, as well as a two-step training strategy is adopted to train our recurrent hybrid network. Experiment results on two challenging datasets UCF-101 and HMDB-51 demonstrate that the proposed method can reach the state-of-the-art performance.
引用
收藏
页码:192 / 203
页数:12
相关论文
共 50 条
  • [1] Action Recognition of Temporal Segment Network Based on Feature Fusion
    Li H.
    Ding Y.
    Li C.
    Zhang S.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (01): : 145 - 158
  • [2] Joint attentive adaptive feature fusion network for action recognition
    Xia, Fan
    Jiang, Min
    Kong, Jun
    Zhuang, Danfeng
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (01)
  • [3] Local Feature Fusion Temporal Convolutional Network for Human Action Recognition
    Song Z.
    Zhou Y.
    Jia J.
    Xin S.
    Liu Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (03): : 418 - 424
  • [4] A novel feature for action recognition
    Wen, Hao
    Lu, Zhe-Ming
    Cui, Jia-Lin
    Li, Hao-Lai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 41441 - 41456
  • [5] A novel feature for action recognition
    Hao Wen
    Zhe-Ming Lu
    Jia-Lin Cui
    Hao-Lai Li
    Multimedia Tools and Applications, 2024, 83 : 41441 - 41456
  • [6] Recurrent Spatiotemporal Feature Learning for Action Recognition
    Chen, Ze
    Lu, Hongtao
    ICRAI 2018: PROCEEDINGS OF 2018 4TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE -, 2018, : 12 - 17
  • [7] Video Expression Recognition Method Based on Spatiotemporal Recurrent Neural Network and Feature Fusion
    Zhou, Xuan
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2021, 17 (02): : 337 - 351
  • [8] Research on behavior recognition based on feature fusion of automatic coder and recurrent neural network
    Zheng Bing
    Yun Dawei
    Liang Yana
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (06) : 8927 - 8935
  • [9] TRAJECTORY FEATURE FUSION FOR HUMAN ACTION RECOGNITION
    Megrhi, Sameh
    Beghdadi, Azeddine
    Souidene, Wided
    2014 5TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP 2014), 2014,
  • [10] Feature and Decision Level Fusion for Action Recognition
    Abouelenien, Mohamed
    Wan, Yiwen
    Saudagar, Abdullah
    2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION & NETWORKING TECHNOLOGIES (ICCCNT), 2012,