A Video Classification Method Based on Spatiotemporal Detail Attention and Feature Fusion

被引：0

作者：

Gong, Xuchao ^{[1
]}

Li, Zongmin ^{[1
]}

机构：

[1] China Univ Petr East China, Sch Comp Sci & Technol, Qingdao 266580, Peoples R China

来源：

MOBILE INFORMATION SYSTEMS | 2022年 / 2022卷

关键词：

D O I：

10.1155/2022/4213335

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the explosive growth of Internet video data, demands for accurate large-scale video classification and management are increasing. In the real-world deployment, the balance between effectiveness and timeliness should be fully considered. Generally, the video classification algorithm equipped with time segment network is used in industrial deployment, and the frame extraction feature is used to classify video actions However, the issue of semantic deviation will be raised due to coarse feature description. In this paper, we propose a novel method, called image dense feature and internal significant detail description, to enhance the generalization and discrimination of feature description. Specifically, the location information layer of space-time geometric relationship is added to effectively engrave the local features of convolution layer. Moreover, the multimodal feature graph network is introduced to effectively improve the generalization ability of feature fusion. Extensive experiments show that the proposed method can effectively improve the results on two commonly used benchmarks (kinetics 400 and kinetics 600).

引用

页数：10

共 50 条

[31] Research on Real-Time Motion Classification and Counting Algorithm Based on Video
Ke, Mengyun
Ma, Zhuang
Wang, Chongwen
Proceedings - 2022 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Autonomous and Trusted Vehicles, Scalable Computing and Communications, Digital Twin, Privacy Computing, Metaverse, SmartWorld/UIC/ATC/ScalCom/DigitalTwin/PriComp/Metaverse 2022, 2022, : 433 - 440
[32] Multi-Feature Fusion Based Radar Seeker Against Corner Reflector Interference
Tang, Tianxiang
Qiu, Linmao
Li, Chen
Zhang, Jiadong
2024 5th International Conference on Machine Learning and Computer Application, ICMLCA 2024, 2024, : 566 - 570
[33] Face spoofing detection model based on multi-scale predictive feature fusion
Huang, Ling
He, Xi Ping
He, Dan
Proceedings of SPIE - The International Society for Optical Engineering, 2023, 12707
[34] DeepLabv3+ Lightweight Image Segmentation Algorithm Based on Multilevel Feature Fusion
Zhou, Huaping
Deng, Bin
Computer Engineering and Applications, 60 (16): : 269 - 275
[35] RGB-D Saliency Detection Based on Multi-Level Feature Fusion
Shi, Yue
Yu, Wanjun
Chen, Ying
Computer Engineering and Applications, 2023, 59 (07): : 207 - 213
[36] Extraction method of shape feature for vegetables based on depth image
Li, Changyong
Cao, Qixin
Nongye Jixie Xuebao/Transactions of the Chinese Society of Agricultural Machinery, 2012, 43 (SUPPL.1): : 242 - 245
[37] Automated segmentation of skin lesion based on multi-scale feature extraction and attention mechanism
College of Intelligence and Information Engineering, Shandong University of Traditional Chinese Medicine, Jinan
250355, China
TechRxiv,
[38] A throughput-based classification method for GPU programs
Hu, Zhi-Dan
Liu, Guang-Ming
Dong, Wen-Rui
Dongbei Daxue Xuebao/Journal of Northeastern University, 2014, 35 : 341 - 346
[39] Video Anomaly Detection Based on Optical Flow Feature Enhanced Spatio-Temporal Feature Network FusionNet-LSTM-G
Song, Jun-Fang
Zhao, Hai-Li
Wen, Duo-Yang
Xu, Xiao-Yu
IEEE Access, 2022, 10 : 130314 - 130325
[40] Hyperspectral image classification with localized spectral filtering-based graph attention network
Pu, S.
Song, Y.
Chen, Y.
Li, Y.
Zhang, J.
Lin, Q.
Zhu, X.
Chen, Y.
Zeng, H.
Liao, K.
Yu, H.
Yuan, J.
Yu, S.
Zuo, W.
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 2022, 5 (03) : 155 - 161

← 1 2 3 4 5 →