A Video Classification Method Based on Spatiotemporal Detail Attention and Feature Fusion

被引:0
|
作者
Gong, Xuchao [1 ]
Li, Zongmin [1 ]
机构
[1] China Univ Petr East China, Sch Comp Sci & Technol, Qingdao 266580, Peoples R China
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
10.1155/2022/4213335
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the explosive growth of Internet video data, demands for accurate large-scale video classification and management are increasing. In the real-world deployment, the balance between effectiveness and timeliness should be fully considered. Generally, the video classification algorithm equipped with time segment network is used in industrial deployment, and the frame extraction feature is used to classify video actions However, the issue of semantic deviation will be raised due to coarse feature description. In this paper, we propose a novel method, called image dense feature and internal significant detail description, to enhance the generalization and discrimination of feature description. Specifically, the location information layer of space-time geometric relationship is added to effectively engrave the local features of convolution layer. Moreover, the multimodal feature graph network is introduced to effectively improve the generalization ability of feature fusion. Extensive experiments show that the proposed method can effectively improve the results on two commonly used benchmarks (kinetics 400 and kinetics 600).
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Research on Real-Time Motion Classification and Counting Algorithm Based on Video
    Ke, Mengyun
    Ma, Zhuang
    Wang, Chongwen
    Proceedings - 2022 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Autonomous and Trusted Vehicles, Scalable Computing and Communications, Digital Twin, Privacy Computing, Metaverse, SmartWorld/UIC/ATC/ScalCom/DigitalTwin/PriComp/Metaverse 2022, 2022, : 433 - 440
  • [32] Multi-Feature Fusion Based Radar Seeker Against Corner Reflector Interference
    Tang, Tianxiang
    Qiu, Linmao
    Li, Chen
    Zhang, Jiadong
    2024 5th International Conference on Machine Learning and Computer Application, ICMLCA 2024, 2024, : 566 - 570
  • [33] Face spoofing detection model based on multi-scale predictive feature fusion
    Huang, Ling
    He, Xi Ping
    He, Dan
    Proceedings of SPIE - The International Society for Optical Engineering, 2023, 12707
  • [34] DeepLabv3+ Lightweight Image Segmentation Algorithm Based on Multilevel Feature Fusion
    Zhou, Huaping
    Deng, Bin
    Computer Engineering and Applications, 60 (16): : 269 - 275
  • [35] RGB-D Saliency Detection Based on Multi-Level Feature Fusion
    Shi, Yue
    Yu, Wanjun
    Chen, Ying
    Computer Engineering and Applications, 2023, 59 (07): : 207 - 213
  • [36] Extraction method of shape feature for vegetables based on depth image
    Li, Changyong
    Cao, Qixin
    Nongye Jixie Xuebao/Transactions of the Chinese Society of Agricultural Machinery, 2012, 43 (SUPPL.1): : 242 - 245
  • [37] Automated segmentation of skin lesion based on multi-scale feature extraction and attention mechanism
    College of Intelligence and Information Engineering, Shandong University of Traditional Chinese Medicine, Jinan
    250355, China
    TechRxiv,
  • [38] A throughput-based classification method for GPU programs
    Hu, Zhi-Dan
    Liu, Guang-Ming
    Dong, Wen-Rui
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2014, 35 : 341 - 346
  • [39] Video Anomaly Detection Based on Optical Flow Feature Enhanced Spatio-Temporal Feature Network FusionNet-LSTM-G
    Song, Jun-Fang
    Zhao, Hai-Li
    Wen, Duo-Yang
    Xu, Xiao-Yu
    IEEE Access, 2022, 10 : 130314 - 130325
  • [40] Hyperspectral image classification with localized spectral filtering-based graph attention network
    Pu, S.
    Song, Y.
    Chen, Y.
    Li, Y.
    Zhang, J.
    Lin, Q.
    Zhu, X.
    Chen, Y.
    Zeng, H.
    Liao, K.
    Yu, H.
    Yuan, J.
    Yu, S.
    Zuo, W.
    ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 2022, 5 (03) : 155 - 161