KS-FuseNet: An Efficient Action Recognition Method Based on Keyframe Selection and Feature Fusion

被引:0
|
作者
Mao, Keming [1 ]
Xiao, Yilong [1 ]
Jing, Xin [1 ]
Hu, Zepeng [1 ]
Ping, Yi [1 ]
机构
[1] Northeastern Univ, Software Coll, Shenyang, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII | 2025年 / 15037卷
关键词
Action recognition; Spatial-temporal; Feature fusion; Keyframe selection; CONTEXT;
D O I
10.1007/978-981-97-8511-7_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Addressing the challenge of effectively capturing features in contemporary video tasks, we propose an action recognition approach grounded in keyframe filtering and feature fusion. Our method comprises two core modules. The keyframe screening module employs an attention mechanism to segregate the input depth feature map sequence into two distinct tensors, effectively reducing spatial redundancy computation and enhancing key feature capture. The other spatio-temporal and action feature module features two branches with divergent structures, performing spatio-temporal and action feature extraction on the differentiated features from the previous module. Through these closely linked modules, our approach effectively discerns and extracts meaningful video features for subsequent classification tasks. We construct an end-to-end deep learning model using established frameworks, training and validating it on a generic video dataset, and confirm its efficacy through comparison and ablation experiments. Experiments conducted on this dataset demonstrate that our model surpasses the majority of prior works.
引用
收藏
页码:540 / 553
页数:14
相关论文
共 50 条
  • [1] EFFICIENT OBJECT FEATURE SELECTION FOR ACTION RECOGNITION
    Zhang, Tianyi
    Zhang, Yu
    Cai, Jianfei
    Kot, Alex C.
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2707 - 2711
  • [2] An Improved VLAD Coding Method Based on Fusion Feature in Action Recognition
    Luo H.-L.
    Wang C.-J.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (01): : 49 - 58
  • [3] Keyframe recommendation based on feature intercross and fusion
    Yang, Guanci
    He, Zonglin
    Su, Zhidong
    Li, Yang
    Hu, Bingqi
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 4955 - 4971
  • [4] Action Recognition of Temporal Segment Network Based on Feature Fusion
    Li H.
    Ding Y.
    Li C.
    Zhang S.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (01): : 145 - 158
  • [5] Realistic Human Action Recognition With Multimodal Feature Selection and Fusion
    Wu, Qiuxia
    Wang, Zhiyong
    Deng, Feiqi
    Chi, Zheru
    Feng, David Dagan
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2013, 43 (04): : 875 - 885
  • [6] An efficient video transformer network with token discard and keyframe enhancement for action recognition
    Zhang, Qian
    Yang, Zuosui
    Shao, Mingwen
    Liang, Hong
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (02):
  • [7] Emitter Recognition Method Based on Feature Fusion
    Tian, Di
    Zhang, Jing
    Hu, Po
    Li, Zhongqi
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4178 - 4183
  • [8] Discriminative Feature Fusion with Spectral Method for Human Action Recognition
    Xiao, Xiang
    Liu, Le
    Hu, Haifeng
    BIOMETRIC RECOGNITION, CCBR 2015, 2015, 9428 : 641 - 648
  • [9] Hierarchical Bayesian Multiple Kernel Learning Based Feature Fusion for Action Recognition
    Sun, Wen
    Yuan, Chunfeng
    Wang, Pei
    Yang, Shuang
    Hu, Weiming
    Cai, Zhaoquan
    MULTIMODAL PATTERN RECOGNITION OF SOCIAL SIGNALS IN HUMAN-COMPUTER-INTERACTION, MPRSS 2016, 2017, 10183 : 85 - 97
  • [10] Action recognition method based on lightweight network and rough-fine keyframe extraction
    Pan, Hao
    Tian, Qiuhong
    Li, Saiwei
    Miao, Weilun
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 97