A Video Classification Method Based on Spatiotemporal Detail Attention and Feature Fusion

被引:0
|
作者
Gong, Xuchao [1 ]
Li, Zongmin [1 ]
机构
[1] China Univ Petr East China, Sch Comp Sci & Technol, Qingdao 266580, Peoples R China
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
10.1155/2022/4213335
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the explosive growth of Internet video data, demands for accurate large-scale video classification and management are increasing. In the real-world deployment, the balance between effectiveness and timeliness should be fully considered. Generally, the video classification algorithm equipped with time segment network is used in industrial deployment, and the frame extraction feature is used to classify video actions However, the issue of semantic deviation will be raised due to coarse feature description. In this paper, we propose a novel method, called image dense feature and internal significant detail description, to enhance the generalization and discrimination of feature description. Specifically, the location information layer of space-time geometric relationship is added to effectively engrave the local features of convolution layer. Moreover, the multimodal feature graph network is introduced to effectively improve the generalization ability of feature fusion. Extensive experiments show that the proposed method can effectively improve the results on two commonly used benchmarks (kinetics 400 and kinetics 600).
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Road Crack Model Based on Multi-Level Feature Fusion and Attention Mechanism
    Song, Rongrong
    Wang, Caiyong
    Tian, Qichuan
    Zhang, Qi
    Computer Engineering and Applications, 2023, 59 (13): : 281 - 288
  • [2] Silent liveness detection algorithm based on multi classification and feature fusion network
    Huang, Xin-Yu
    You, Fan
    Zhang, Pei
    Zhang, Zhao
    Zhang, Bai-Li
    Lv, Jian-Hua
    Xu, Li-Zhen
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (02): : 263 - 270
  • [3] Semantic Segmentation of Remote-Sensing Images Based on Multiscale Feature Fusion and Attention Refinement
    He, Xin
    Zhou, Yong
    Zhao, Jiaqi
    Zhang, Man
    Yao, Rui
    Liu, Bing
    Li, Haichao
    IEEE Geoscience and Remote Sensing Letters, 2022, 19
  • [4] Facial Expression Image Classification Based on Multi-scale Feature Fusion Residual Network
    Zhao, Yuxi
    Wang, Chunzhi
    Zhou, Xianjing
    Liu, Hu
    Communications in Computer and Information Science, 2023, 1811 CCIS : 105 - 118
  • [5] Identification method of corner reflector based on polarization and HRRP feature fusion for radar seeker
    Han, Jingwen
    Yang, Yong
    Lian, Jing
    Wu, Guoqing
    Wang, Xuesong
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 46 (11): : 3658 - 3670
  • [6] Efficient Attention Fusion Feature Extraction Network for Image Super-Resolution
    Wang, Tuoran
    Cheng, Na
    Ding, Shijia
    Wang, Hongyu
    ACM International Conference Proceeding Series, 2023, : 35 - 44
  • [7] Feature Fusion Method for Low-Illumination Images
    Li, Ya-Nan
    Zhang, Zhen-Feng
    Chen, Yi-Fan
    Huang, Chu-Hua
    Journal of Computers (Taiwan), 2022, 33 (06) : 167 - 180
  • [8] Video Salient Object Detection Via Spatiotemporal Co-Attention and Global Structural Dependence
    Liu, Bing
    Wang, Tiantian
    Gao, Lina
    Yan, Zheng
    Xu, Mingzhu
    SSRN, 2023,
  • [9] Malicious URL Detection Based on Multiple Feature Fusion
    Wu, Sen-Yan
    Luo, Xi
    Wang, Wei-Ping
    Qin, Yan
    Ruan Jian Xue Bao/Journal of Software, 2021, 32 (09): : 2916 - 2934
  • [10] A Point Cloud Classification Method and Its Applications Based on Multi-Head Self-Attention
    Liu, Xue-Jun
    Wang, Wen-Hui
    Yan, Yong
    Cui, Zhong-Ji
    Sha, Yun
    Jiang, Yi-Nan
    Journal of Computers (Taiwan), 2023, 34 (04) : 163 - 173