Micro-Video Event Detection Based on Deep Dynamic Semantic Correlation

被引:0
|
作者
Jing Peiguang [1 ]
Song Xiaoyi [1 ]
Su Yuting [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
关键词
micro-video; semantic correlation; feature representation; graph convolution;
D O I
10.3788/LOP230994
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Nowadays, micro-video event detection exhibits great potential for various applications. As for event detection, previous studies usually ignore the importance of keyframes and mostly focus on the exploration of explicit attributes of events. They neglect the exploration of latent semantic representations and their relationships. Aiming at the above problems, a deep dynamic semantic correlation method is proposed for micro-video event detection. First, the frame importance evaluation module is designed to obtain more distinguishing scores of keyframes, in which the joint structure of variational autoencoder and generative adversarial network can strengthen the importance of information to the greatest extent. Then, the intrinsic correlations between keyframes and the corresponding features are cooperated through a keyframe-guided self-attention mechanism. Finally, the hidden event attribute correlation module based on dynamic graph convolution is designed to learn latent semantics and the corresponding correlation patterns of events. The obtained latent semantic-aware representations are used for final micro-video event detection. Experiments performed on the public datasets and the newly constructed micro-video event detection dataset demonstrate the effectiveness of the proposed method.
引用
收藏
页数:10
相关论文
共 38 条
  • [1] Iktishaf: a Big Data Road-Traffic Event Detection Tool Using Twitter and Spark Machine Learning
    Alomari, Ebtesam
    Katib, Iyad
    Mehmood, Rashid
    [J]. MOBILE NETWORKS & APPLICATIONS, 2023, 28 (02): : 603 - 618
  • [2] Bastings Joost, 2017, P 2017 C EMP METH NA, P1957, DOI DOI 10.48550/ARXIV.1806.08804
  • [3] Knowledge-Preserving Incremental Social Event Detection via Heterogeneous GNNs
    Cao, Yuwei
    Peng, Hao
    Wu, Jia
    Dou, Yingtong
    Li, Jianxin
    Yu, Philip S.
    [J]. PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 3383 - 3395
  • [4] Semantic Pooling for Complex Event Analysis in Untrimmed Videos
    Chang, Xiaojun
    Yu, Yao-Liang
    Yang, Yi
    Xing, Eric P.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (08) : 1617 - 1632
  • [5] Implicit Rating Methods Based on Interest Preferences of Categories for Micro-Video Recommendation
    Chen, Jie
    Peng, Junjie
    Qi, Lizhe
    Chen, Gan
    Zhang, Wenqiang
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 371 - 381
  • [6] Multi-Label Image Recognition with Graph Convolutional Networks
    Chen, Zhao-Min
    Wei, Xiu-Shen
    Wang, Peng
    Guo, Yanwen
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5172 - 5181
  • [7] Chu G H, 2023, Acta Optica, V43, P264
  • [8] Learning Spatiotemporal Features with 3D Convolutional Networks
    Du Tran
    Bourdev, Lubomir
    Fergus, Rob
    Torresani, Lorenzo
    Paluri, Manohar
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4489 - 4497
  • [9] X3D: Expanding Architectures for Efficient Video Recognition
    Feichtenhofer, Christoph
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 200 - 210
  • [10] Gated-ViGAT: Efficient Bottom-Up Event Recognition and Explanation Using a New Frame Selection Policy and Gating Mechanism
    Gkalelis, Nikolaos
    Daskalakis, Dimitrios
    Mezaris, Vasileios
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2022, : 113 - 120