A novel recurrent hybrid network for feature fusion in action recognition

被引：15

作者：

Yu, Sheng ^{[1
,2
,3
]}

Cheng, Yun ^{[2
]}

Xie, Li ^{[2
]}

Luo, Zhiming ^{[1
,3
]}

Huang, Min ^{[1
,3
]}

Li, Shaozi ^{[1
,3
]}

机构：

[1] Xiamen Univ, Dept Cognit Sci, Xiamen 361005, Fujian, Peoples R China

[2] Hunan Univ Humanities Sci & Technol, Sch Informat, Loudi, Hunan, Peoples R China

[3] Fujian Key Lab Brain Intelligent Syst, Xiamen, Fujian, Peoples R China

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2017年 / 49卷

关键词：

Deep learning; Action recognition; LSTM; CNNs; IDT; REPRESENTATION;

D O I：

10.1016/j.jvcir.2017.09.007

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Action recognition in video is one of the most important and challenging tasks in computer vision. How to efficiently combine the spatial-temporal information to represent video plays a crucial role for action recognition. In this paper, a recurrent hybrid network architecture is designed for action recognition by fusing multi-source features: a two-stream CNNs for learning semantic features, a two-stream single-layer LSTM for learning long-term temporal feature, and an Improved Dense Trajectories (IDT) stream for learning short-term temporal motion feature. In order to mitigate the overfitting issue on small-scale dataset, a video data augmentation method is used to increase the amount of training data, as well as a two-step training strategy is adopted to train our recurrent hybrid network. Experiment results on two challenging datasets UCF-101 and HMDB-51 demonstrate that the proposed method can reach the state-of-the-art performance.

引用

页码：192 / 203

页数：12

共 50 条

[1] Action Recognition of Temporal Segment Network Based on Feature Fusion
Li H.
Ding Y.
Li C.
Zhang S.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (01): : 145 - 158
[2] Joint attentive adaptive feature fusion network for action recognition
Xia, Fan
Jiang, Min
Kong, Jun
Zhuang, Danfeng
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (01)
[3] Local Feature Fusion Temporal Convolutional Network for Human Action Recognition
Song Z.
Zhou Y.
Jia J.
Xin S.
Liu Y.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (03): : 418 - 424
[4] A novel feature for action recognition
Wen, Hao
Lu, Zhe-Ming
Cui, Jia-Lin
Li, Hao-Lai
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 41441 - 41456
[5] A novel feature for action recognition
Hao Wen
Zhe-Ming Lu
Jia-Lin Cui
Hao-Lai Li
Multimedia Tools and Applications, 2024, 83 : 41441 - 41456
[6] Recurrent Spatiotemporal Feature Learning for Action Recognition
Chen, Ze
Lu, Hongtao
ICRAI 2018: PROCEEDINGS OF 2018 4TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE -, 2018, : 12 - 17
[7] Video Expression Recognition Method Based on Spatiotemporal Recurrent Neural Network and Feature Fusion
Zhou, Xuan
JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2021, 17 (02): : 337 - 351
[8] Research on behavior recognition based on feature fusion of automatic coder and recurrent neural network
Zheng Bing
Yun Dawei
Liang Yana
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (06) : 8927 - 8935
[9] TRAJECTORY FEATURE FUSION FOR HUMAN ACTION RECOGNITION
Megrhi, Sameh
Beghdadi, Azeddine
Souidene, Wided
2014 5TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP 2014), 2014,
[10] Feature and Decision Level Fusion for Action Recognition
Abouelenien, Mohamed
Wan, Yiwen
Saudagar, Abdullah
2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION & NETWORKING TECHNOLOGIES (ICCCNT), 2012,

← 1 2 3 4 5 →