KS-FuseNet: An Efficient Action Recognition Method Based on Keyframe Selection and Feature Fusion

被引:0
|
作者
Mao, Keming [1 ]
Xiao, Yilong [1 ]
Jing, Xin [1 ]
Hu, Zepeng [1 ]
Ping, Yi [1 ]
机构
[1] Northeastern Univ, Software Coll, Shenyang, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII | 2025年 / 15037卷
关键词
Action recognition; Spatial-temporal; Feature fusion; Keyframe selection; CONTEXT;
D O I
10.1007/978-981-97-8511-7_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Addressing the challenge of effectively capturing features in contemporary video tasks, we propose an action recognition approach grounded in keyframe filtering and feature fusion. Our method comprises two core modules. The keyframe screening module employs an attention mechanism to segregate the input depth feature map sequence into two distinct tensors, effectively reducing spatial redundancy computation and enhancing key feature capture. The other spatio-temporal and action feature module features two branches with divergent structures, performing spatio-temporal and action feature extraction on the differentiated features from the previous module. Through these closely linked modules, our approach effectively discerns and extracts meaningful video features for subsequent classification tasks. We construct an end-to-end deep learning model using established frameworks, training and validating it on a generic video dataset, and confirm its efficacy through comparison and ablation experiments. Experiments conducted on this dataset demonstrate that our model surpasses the majority of prior works.
引用
收藏
页码:540 / 553
页数:14
相关论文
共 50 条
  • [31] Multi-dimension Feature Fusion for Action Recognition
    Dong, Pei
    Li, Jie
    Dong, Junyu
    Qi, Lin
    NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [32] Joint Feature Optimization and Fusion for Compressed Action Recognition
    Li, Hanhui
    Jiang, Xudong
    Guan, Boliang
    Tan, Raymond Rui Ming
    Wang, Ruomei
    Thalmann, Nadia Magnenat
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 7926 - 7937
  • [33] Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition
    Zhu, Xiaoguang
    Zhu, Ye
    Wang, Haoyu
    Wen, Honglin
    Yan, Yan
    Liu, Peilin
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (03)
  • [34] Named Entity Recognition Method Based on Multi-Feature Fusion
    Huang, Weidong
    Yu, Xinhang
    APPLIED SCIENCES-BASEL, 2025, 15 (01):
  • [35] Adaptive Feature Selection With Reinforcement Learning for Skeleton-Based Action Recognition
    Xu, Zheyuan
    Wang, Yingfu
    Jiang, Jiaqin
    Yao, Jian
    Li, Liang
    IEEE ACCESS, 2020, 8 : 213038 - 213051
  • [36] FEASE: Feature Selection and Enhancement Networks for Action Recognition
    Zhou, Lu
    Lu, Yuanyao
    Jiang, Haiyang
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [37] FEASE: Feature Selection and Enhancement Networks for Action Recognition
    Lu Zhou
    Yuanyao Lu
    Haiyang Jiang
    Neural Processing Letters, 56
  • [38] Collaborative and Multilevel Feature Selection Network for Action Recognition
    Zheng, Zhenxing
    An, Gaoyun
    Cao, Shan
    Wu, Dapeng
    Ruan, Qiuqi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (03) : 1304 - 1318
  • [39] Action Recognition Based on the Feature Trajectories
    Du, Ji-Xiang
    Yang, Kai
    Zhai, Chuan-Min
    INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2012, 2012, 7390 : 250 - 257
  • [40] Face recognition based on selection approach via Canonical Correlation Analysis feature fusion
    Huy Nguyen-Quoc
    Vinh Truong Hoang
    2020 ZOOMING INNOVATION IN CONSUMER TECHNOLOGIES CONFERENCE (ZINC), 2020, : 54 - 57