A Video Classification Method Based on Spatiotemporal Detail Attention and Feature Fusion

被引：0

作者：

Gong, Xuchao ^{[1
]}

Li, Zongmin ^{[1
]}

机构：

[1] China Univ Petr East China, Sch Comp Sci & Technol, Qingdao 266580, Peoples R China

来源：

MOBILE INFORMATION SYSTEMS | 2022年 / 2022卷

关键词：

D O I：

10.1155/2022/4213335

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the explosive growth of Internet video data, demands for accurate large-scale video classification and management are increasing. In the real-world deployment, the balance between effectiveness and timeliness should be fully considered. Generally, the video classification algorithm equipped with time segment network is used in industrial deployment, and the frame extraction feature is used to classify video actions However, the issue of semantic deviation will be raised due to coarse feature description. In this paper, we propose a novel method, called image dense feature and internal significant detail description, to enhance the generalization and discrimination of feature description. Specifically, the location information layer of space-time geometric relationship is added to effectively engrave the local features of convolution layer. Moreover, the multimodal feature graph network is introduced to effectively improve the generalization ability of feature fusion. Extensive experiments show that the proposed method can effectively improve the results on two commonly used benchmarks (kinetics 400 and kinetics 600).

引用

页数：10

共 50 条

[1] Road Crack Model Based on Multi-Level Feature Fusion and Attention Mechanism
Song, Rongrong
Wang, Caiyong
Tian, Qichuan
Zhang, Qi
Computer Engineering and Applications, 2023, 59 (13): : 281 - 288
[2] Silent liveness detection algorithm based on multi classification and feature fusion network
Huang, Xin-Yu
You, Fan
Zhang, Pei
Zhang, Zhao
Zhang, Bai-Li
Lv, Jian-Hua
Xu, Li-Zhen
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (02): : 263 - 270
[3] Semantic Segmentation of Remote-Sensing Images Based on Multiscale Feature Fusion and Attention Refinement
He, Xin
Zhou, Yong
Zhao, Jiaqi
Zhang, Man
Yao, Rui
Liu, Bing
Li, Haichao
IEEE Geoscience and Remote Sensing Letters, 2022, 19
[4] Facial Expression Image Classification Based on Multi-scale Feature Fusion Residual Network
Zhao, Yuxi
Wang, Chunzhi
Zhou, Xianjing
Liu, Hu
Communications in Computer and Information Science, 2023, 1811 CCIS : 105 - 118
[5] Identification method of corner reflector based on polarization and HRRP feature fusion for radar seeker
Han, Jingwen
Yang, Yong
Lian, Jing
Wu, Guoqing
Wang, Xuesong
Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 46 (11): : 3658 - 3670
[6] Efficient Attention Fusion Feature Extraction Network for Image Super-Resolution
Wang, Tuoran
Cheng, Na
Ding, Shijia
Wang, Hongyu
ACM International Conference Proceeding Series, 2023, : 35 - 44
[7] Feature Fusion Method for Low-Illumination Images
Li, Ya-Nan
Zhang, Zhen-Feng
Chen, Yi-Fan
Huang, Chu-Hua
Journal of Computers (Taiwan), 2022, 33 (06) : 167 - 180
[8] Video Salient Object Detection Via Spatiotemporal Co-Attention and Global Structural Dependence
Liu, Bing
Wang, Tiantian
Gao, Lina
Yan, Zheng
Xu, Mingzhu
SSRN, 2023,
[9] Malicious URL Detection Based on Multiple Feature Fusion
Wu, Sen-Yan
Luo, Xi
Wang, Wei-Ping
Qin, Yan
Ruan Jian Xue Bao/Journal of Software, 2021, 32 (09): : 2916 - 2934
[10] A Point Cloud Classification Method and Its Applications Based on Multi-Head Self-Attention
Liu, Xue-Jun
Wang, Wen-Hui
Yan, Yong
Cui, Zhong-Ji
Sha, Yun
Jiang, Yi-Nan
Journal of Computers (Taiwan), 2023, 34 (04) : 163 - 173

← 1 2 3 4 5 →