A Video Classification Method Based on Spatiotemporal Detail Attention and Feature Fusion

被引:0
|
作者
Gong, Xuchao [1 ]
Li, Zongmin [1 ]
机构
[1] China Univ Petr East China, Sch Comp Sci & Technol, Qingdao 266580, Peoples R China
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
10.1155/2022/4213335
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the explosive growth of Internet video data, demands for accurate large-scale video classification and management are increasing. In the real-world deployment, the balance between effectiveness and timeliness should be fully considered. Generally, the video classification algorithm equipped with time segment network is used in industrial deployment, and the frame extraction feature is used to classify video actions However, the issue of semantic deviation will be raised due to coarse feature description. In this paper, we propose a novel method, called image dense feature and internal significant detail description, to enhance the generalization and discrimination of feature description. Specifically, the location information layer of space-time geometric relationship is added to effectively engrave the local features of convolution layer. Moreover, the multimodal feature graph network is introduced to effectively improve the generalization ability of feature fusion. Extensive experiments show that the proposed method can effectively improve the results on two commonly used benchmarks (kinetics 400 and kinetics 600).
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Defect detection of jacquard knitted fabrics based on nonlinear diffusion and multi-feature fusion
    Shi, Weimin
    Jian, Qiang
    Li, Jianqiang
    Ru, Xin
    Peng, Laihu
    Fangzhi Xuebao/Journal of Textile Research, 2023, 44 (07): : 86 - 94
  • [42] Network traffic prediction based on feature fusion spatio-temporal graph convolutional network
    Key Laboratory of Universal Wireless Communications, Ministry of Education, Beijing University of Posts and Telecommunications, Beijing
    100876, China
    不详
    100876, China
    Proc SPIE Int Soc Opt Eng,
  • [43] Driver Road Rage Recognition Based on Improved MFCC Fusion Feature and FA-PNN
    Li, Shangqing
    Wang, Xiaoyuan
    Zhang, Yang
    Li, Hao
    Xiang, Hui
    Computer Engineering and Applications, 2024, 59 (02): : 306 - 313
  • [44] Mosaic and repair method of locust slices based on feature extraction and matching
    College of Information and Electrical Engineering, China Agricultural University, Beijing
    100083, China
    Nongye Gongcheng Xuebao, 7 (157-165):
  • [45] Chaff identification method based on Range-Doppler imaging feature
    Wang, Husheng
    Chen, Baixiao
    Zhu, Dongchen
    Huang, Fengsheng
    Yu, Xiangzhen
    Ye, Qingzhi
    Cheng, Xiancheng
    Peng, Shuai
    Jing, Jiaqiu
    IET Radar, Sonar and Navigation, 2022, 16 (11): : 1861 - 1871
  • [46] Subway Platform Passenger Flow Counting Algorithm Based on Feature-Enhanced Pyramid and Mixed Attention
    Zuo, Jing
    Liu, Guoyan
    Yu, Zhao
    JOURNAL OF ADVANCED TRANSPORTATION, 2023, 2023
  • [47] Fault detection and diagnosis method based on weighted statistical feature KICA
    Zhang, Cheng
    Pan, Lizhi
    Li, Yuan
    Huagong Xuebao/CIESC Journal, 2022, 73 (02): : 827 - 837
  • [48] Improving the faithfulness of attention-based explanations with task-specific information for text classification
    Chrysostomou, George
    Aletras, Nikolaos
    arXiv, 2021,
  • [49] Object Detection Algorithm with Dual-Modal Rectification Fusion Based on Self-Guided Attention
    Zhang, Jinglei
    Gong, Wenhao
    Jia, Xin
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (09): : 793 - 805
  • [50] A Transformer-based Multi-modal Joint Attention Fusion Model for Molecular Property Prediction
    Wang, Ke
    Zhang, Wei
    Liu, Yong
    Proceedings - 2023 2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023, 2023, : 4972 - 4974