A Video Classification Method Based on Spatiotemporal Detail Attention and Feature Fusion

被引：0

作者：

Gong, Xuchao ^{[1
]}

Li, Zongmin ^{[1
]}

机构：

[1] China Univ Petr East China, Sch Comp Sci & Technol, Qingdao 266580, Peoples R China

来源：

MOBILE INFORMATION SYSTEMS | 2022年 / 2022卷

关键词：

D O I：

10.1155/2022/4213335

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the explosive growth of Internet video data, demands for accurate large-scale video classification and management are increasing. In the real-world deployment, the balance between effectiveness and timeliness should be fully considered. Generally, the video classification algorithm equipped with time segment network is used in industrial deployment, and the frame extraction feature is used to classify video actions However, the issue of semantic deviation will be raised due to coarse feature description. In this paper, we propose a novel method, called image dense feature and internal significant detail description, to enhance the generalization and discrimination of feature description. Specifically, the location information layer of space-time geometric relationship is added to effectively engrave the local features of convolution layer. Moreover, the multimodal feature graph network is introduced to effectively improve the generalization ability of feature fusion. Extensive experiments show that the proposed method can effectively improve the results on two commonly used benchmarks (kinetics 400 and kinetics 600).

引用

页数：10

共 50 条

[41] Defect detection of jacquard knitted fabrics based on nonlinear diffusion and multi-feature fusion
Shi, Weimin
Jian, Qiang
Li, Jianqiang
Ru, Xin
Peng, Laihu
Fangzhi Xuebao/Journal of Textile Research, 2023, 44 (07): : 86 - 94
[42] Network traffic prediction based on feature fusion spatio-temporal graph convolutional network
Key Laboratory of Universal Wireless Communications, Ministry of Education, Beijing University of Posts and Telecommunications, Beijing
100876, China
不详
100876, China
Proc SPIE Int Soc Opt Eng,
[43] Driver Road Rage Recognition Based on Improved MFCC Fusion Feature and FA-PNN
Li, Shangqing
Wang, Xiaoyuan
Zhang, Yang
Li, Hao
Xiang, Hui
Computer Engineering and Applications, 2024, 59 (02): : 306 - 313
[44] Mosaic and repair method of locust slices based on feature extraction and matching
College of Information and Electrical Engineering, China Agricultural University, Beijing
100083, China
Nongye Gongcheng Xuebao, 7 (157-165):
[45] Chaff identification method based on Range-Doppler imaging feature
Wang, Husheng
Chen, Baixiao
Zhu, Dongchen
Huang, Fengsheng
Yu, Xiangzhen
Ye, Qingzhi
Cheng, Xiancheng
Peng, Shuai
Jing, Jiaqiu
IET Radar, Sonar and Navigation, 2022, 16 (11): : 1861 - 1871
[46] Subway Platform Passenger Flow Counting Algorithm Based on Feature-Enhanced Pyramid and Mixed Attention
Zuo, Jing
Liu, Guoyan
Yu, Zhao
JOURNAL OF ADVANCED TRANSPORTATION, 2023, 2023
[47] Fault detection and diagnosis method based on weighted statistical feature KICA
Zhang, Cheng
Pan, Lizhi
Li, Yuan
Huagong Xuebao/CIESC Journal, 2022, 73 (02): : 827 - 837
[48] Improving the faithfulness of attention-based explanations with task-specific information for text classification
Chrysostomou, George
Aletras, Nikolaos
arXiv, 2021,
[49] Object Detection Algorithm with Dual-Modal Rectification Fusion Based on Self-Guided Attention
Zhang, Jinglei
Gong, Wenhao
Jia, Xin
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (09): : 793 - 805
[50] A Transformer-based Multi-modal Joint Attention Fusion Model for Molecular Property Prediction
Wang, Ke
Zhang, Wei
Liu, Yong
Proceedings - 2023 2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023, 2023, : 4972 - 4974

← 1 2 3 4 5 →