Micro-Video Event Detection Based on Deep Dynamic Semantic Correlation

被引：0

作者：

Jing Peiguang ^{[1
]}

Song Xiaoyi ^{[1
]}

Su Yuting ^{[1
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

来源：

LASER & OPTOELECTRONICS PROGRESS | 2024年 / 61卷 / 04期

关键词：

micro-video; semantic correlation; feature representation; graph convolution;

D O I：

10.3788/LOP230994

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Nowadays, micro-video event detection exhibits great potential for various applications. As for event detection, previous studies usually ignore the importance of keyframes and mostly focus on the exploration of explicit attributes of events. They neglect the exploration of latent semantic representations and their relationships. Aiming at the above problems, a deep dynamic semantic correlation method is proposed for micro-video event detection. First, the frame importance evaluation module is designed to obtain more distinguishing scores of keyframes, in which the joint structure of variational autoencoder and generative adversarial network can strengthen the importance of information to the greatest extent. Then, the intrinsic correlations between keyframes and the corresponding features are cooperated through a keyframe-guided self-attention mechanism. Finally, the hidden event attribute correlation module based on dynamic graph convolution is designed to learn latent semantics and the corresponding correlation patterns of events. The obtained latent semantic-aware representations are used for final micro-video event detection. Experiments performed on the public datasets and the newly constructed micro-video event detection dataset demonstrate the effectiveness of the proposed method.

引用

页数：10

共 38 条

[1] Iktishaf: a Big Data Road-Traffic Event Detection Tool Using Twitter and Spark Machine Learning
Alomari, Ebtesam
Katib, Iyad
Mehmood, Rashid
[J]. MOBILE NETWORKS & APPLICATIONS, 2023, 28 (02): : 603 - 618
[2] Bastings Joost, 2017, P 2017 C EMP METH NA, P1957, DOI DOI 10.48550/ARXIV.1806.08804
[3] Knowledge-Preserving Incremental Social Event Detection via Heterogeneous GNNs
Cao, Yuwei
Peng, Hao
Wu, Jia
Dou, Yingtong
Li, Jianxin
Yu, Philip S.
[J]. PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 3383 - 3395
[4] Semantic Pooling for Complex Event Analysis in Untrimmed Videos
Chang, Xiaojun
Yu, Yao-Liang
Yang, Yi
Xing, Eric P.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (08) : 1617 - 1632
[5] Implicit Rating Methods Based on Interest Preferences of Categories for Micro-Video Recommendation
Chen, Jie
Peng, Junjie
Qi, Lizhe
Chen, Gan
Zhang, Wenqiang
[J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 371 - 381
[6] Multi-Label Image Recognition with Graph Convolutional Networks
Chen, Zhao-Min
Wei, Xiu-Shen
Wang, Peng
Guo, Yanwen
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5172 - 5181
[7] Chu G H, 2023, Acta Optica, V43, P264
[8] Learning Spatiotemporal Features with 3D Convolutional Networks
Du Tran
Bourdev, Lubomir
Fergus, Rob
Torresani, Lorenzo
Paluri, Manohar
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4489 - 4497
[9] X3D: Expanding Architectures for Efficient Video Recognition
Feichtenhofer, Christoph
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 200 - 210
[10] Gated-ViGAT: Efficient Bottom-Up Event Recognition and Explanation Using a New Frame Selection Policy and Gating Mechanism
Gkalelis, Nikolaos
Daskalakis, Dimitrios
Mezaris, Vasileios
[J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2022, : 113 - 120

← 1 2 3 4 →