Zero-Shot Video Grounding With Pseudo Query Lookup and Verification

被引:5
|
作者
Lu, Yu [1 ]
Quan, Ruijie [2 ]
Zhu, Linchao [2 ]
Yang, Yi [2 ]
机构
[1] Univ Technol Sydney, Australian Artificial Intelligence Inst, ReLER Lab, Ultimo, NSW 2007, Australia
[2] Zhejiang Univ, Coll Comp Sci & Technol, CCAI, Hangzhou 310027, Peoples R China
基金
澳大利亚研究理事会;
关键词
Grounding; Detectors; Proposals; Training; Task analysis; Visualization; Semantics; Video grounding; zero-shot learning; vision and language; NETWORK; LOCALIZATION;
D O I
10.1109/TIP.2024.3365249
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video grounding, the process of identifying a specific moment in an untrimmed video based on a natural language query, has become a popular topic in video understanding. However, fully supervised learning approaches for video grounding that require large amounts of annotated data can be expensive and time-consuming. Recently, zero-shot video grounding (ZS-VG) methods that leverage pre-trained object detectors and language models to generate pseudo-supervision for training video grounding models have been developed. However, these approaches have limitations in recognizing diverse categories and capturing specific dynamics and interactions in the video context. To tackle these challenges, we introduce a novel two-stage ZS-VG framework called Lookup-and-Verification (LoVe), which treats the pseudo-query generation procedure as a video-to-concept retrieval problem. Our approach allows for the extraction of diverse concepts from an open-concept pool and employs a verification process to ensure the relevance of the retrieved concepts to the objects or events of interest in the video proposals. Comprehensive experimental results on the Charades-STA, ActivityNet-Captions, and DiDeMo datasets demonstrate the effectiveness of the LoVe framework.
引用
收藏
页码:1643 / 1654
页数:12
相关论文
共 50 条
  • [11] Deconfounding Causal Inference for Zero-Shot Action Recognition
    Wang, Junyan
    Jiang, Yiqi
    Long, Yang
    Sun, Xiuyu
    Pagnucco, Maurice
    Song, Yang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3976 - 3986
  • [12] From Less to More: Progressive Generalized Zero-Shot Detection With Curriculum Learning
    Liu, Jingren
    Chen, Yi
    Liu, Huajun
    Zhang, Haofeng
    Zhang, Yudong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 19016 - 19029
  • [13] Zero-shot Query Reformulation for Conversational Search
    Yang, Dayu
    Zhang, Yue
    Fang, Hui
    PROCEEDINGS OF THE 2023 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2023, 2023, : 257 - 263
  • [14] Towards Discriminative Feature Generation for Generalized Zero-Shot Learning
    Ge, Jiannan
    Xie, Hongtao
    Li, Pandeng
    Xie, Lingxi
    Min, Shaobo
    Zhang, Yongdong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 10514 - 10529
  • [15] Cooperative Coupled Generative Networks for Generalized Zero-Shot Learning
    Sun, Liang
    Song, Junjie
    Wang, Ye
    Li, Baoyu
    IEEE ACCESS, 2020, 8 : 119287 - 119299
  • [16] Hierarchical Prototype Learning for Zero-Shot Recognition
    Zhang, Xingxing
    Gui, Shupeng
    Zhu, Zhenfeng
    Zhao, Yao
    Liu, Ji
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (07) : 1692 - 1703
  • [17] Differential Refinement Network for Zero-Shot Learning
    Tian, Yi
    Zhang, Yilei
    Huang, Yaping
    Xu, Wanru
    Ding, Zhengming
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 4164 - 4178
  • [18] A Review of Generalized Zero-Shot Learning Methods
    Pourpanah, Farhad
    Abdar, Moloud
    Luo, Yuxuan
    Zhou, Xinlei
    Wang, Ran
    Lim, Chee Peng
    Wang, Xi-Zhao
    Wu, Q. M. Jonathan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4051 - 4070
  • [19] Zero-Shot Video Event Detection With High-Order Semantic Concept Discovery and Matching
    Jin, Yang
    Jiang, Wenhao
    Yang, Yi
    Mu, Yadong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1896 - 1908
  • [20] Multi-Label Zero-Shot Learning With Adversarial and Variational Techniques
    Gull, Muqaddas
    Arif, Omar
    IEEE ACCESS, 2024, 12 : 94990 - 95006