KS-FuseNet: An Efficient Action Recognition Method Based on Keyframe Selection and Feature Fusion

被引：0

作者：

Mao, Keming ^{[1
]}

Xiao, Yilong ^{[1
]}

Jing, Xin ^{[1
]}

Hu, Zepeng ^{[1
]}

Ping, Yi ^{[1
]}

机构：

[1] Northeastern Univ, Software Coll, Shenyang, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII | 2025年 / 15037卷

关键词：

Action recognition; Spatial-temporal; Feature fusion; Keyframe selection; CONTEXT;

D O I：

10.1007/978-981-97-8511-7_38

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Addressing the challenge of effectively capturing features in contemporary video tasks, we propose an action recognition approach grounded in keyframe filtering and feature fusion. Our method comprises two core modules. The keyframe screening module employs an attention mechanism to segregate the input depth feature map sequence into two distinct tensors, effectively reducing spatial redundancy computation and enhancing key feature capture. The other spatio-temporal and action feature module features two branches with divergent structures, performing spatio-temporal and action feature extraction on the differentiated features from the previous module. Through these closely linked modules, our approach effectively discerns and extracts meaningful video features for subsequent classification tasks. We construct an end-to-end deep learning model using established frameworks, training and validating it on a generic video dataset, and confirm its efficacy through comparison and ablation experiments. Experiments conducted on this dataset demonstrate that our model surpasses the majority of prior works.

引用

页码：540 / 553

页数：14

共 50 条

[1] EFFICIENT OBJECT FEATURE SELECTION FOR ACTION RECOGNITION
Zhang, Tianyi
Zhang, Yu
Cai, Jianfei
Kot, Alex C.
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2707 - 2711
[2] An Improved VLAD Coding Method Based on Fusion Feature in Action Recognition
Luo H.-L.
Wang C.-J.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (01): : 49 - 58
[3] Keyframe recommendation based on feature intercross and fusion
Yang, Guanci
He, Zonglin
Su, Zhidong
Li, Yang
Hu, Bingqi
COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 4955 - 4971
[4] Action Recognition of Temporal Segment Network Based on Feature Fusion
Li H.
Ding Y.
Li C.
Zhang S.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (01): : 145 - 158
[5] Realistic Human Action Recognition With Multimodal Feature Selection and Fusion
Wu, Qiuxia
Wang, Zhiyong
Deng, Feiqi
Chi, Zheru
Feng, David Dagan
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2013, 43 (04): : 875 - 885
[6] An efficient video transformer network with token discard and keyframe enhancement for action recognition
Zhang, Qian
Yang, Zuosui
Shao, Mingwen
Liang, Hong
JOURNAL OF SUPERCOMPUTING, 2025, 81 (02):
[7] Emitter Recognition Method Based on Feature Fusion
Tian, Di
Zhang, Jing
Hu, Po
Li, Zhongqi
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4178 - 4183
[8] Discriminative Feature Fusion with Spectral Method for Human Action Recognition
Xiao, Xiang
Liu, Le
Hu, Haifeng
BIOMETRIC RECOGNITION, CCBR 2015, 2015, 9428 : 641 - 648
[9] Hierarchical Bayesian Multiple Kernel Learning Based Feature Fusion for Action Recognition
Sun, Wen
Yuan, Chunfeng
Wang, Pei
Yang, Shuang
Hu, Weiming
Cai, Zhaoquan
MULTIMODAL PATTERN RECOGNITION OF SOCIAL SIGNALS IN HUMAN-COMPUTER-INTERACTION, MPRSS 2016, 2017, 10183 : 85 - 97
[10] Action recognition method based on lightweight network and rough-fine keyframe extraction
Pan, Hao
Tian, Qiuhong
Li, Saiwei
Miao, Weilun
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 97

← 1 2 3 4 5 →